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EDITORIAL 


Our science, our society 


e live in a scientific golden age. Never has the 
pace of discovery been so rapid, the range 
of achievements so broad, and the changing 
nature of our understanding so revolution- 
ary. Science today has extraordinary powers. 
It reveals fundamental phenomena of our 
universe, catalyzes new technologies, powers 
new businesses, fosters new industries, and improves 


dollar business. This isn’t surprising. The institutions that 
support scientific inquiry—universities, research centers, 
federal funding agencies, and private philanthropies—are 
designed to foster individual achievements, amplify indi- 
vidual abilities, and protect individual efforts. Stories of 
discovery and success tend to focus on individuals and 
individual accomplishments. But achieving success in 
science is a team sport, and a nation’s institutions make 
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: . : ; ; ° makers : Susan Hockfield is 
lives. This month’s annual meeting of the American As- | it possible for its scientists to play. For example, in the president of AAAS 
sociation for the Advancement of Science (AAAS, the | United States, embedding much of the scientific enter- : 
publisher of Science) in Austin, prise into institutions of higher and prec 
Texas, celebrates this golden education has catalyzed pro- ear the 
age. More than 10,000 scien- ductive collaborations between Massachusetts 
tists, students, teachers, busi- new and seasoned scholars, and Institute of 
ness people, journalists, philan- between the discovery and the Technology, 
thropists, science enthusiasts, transmission of knowledge. Cambridge, MA, 
government officials, and oth- When the focus of science is USA. hockfield@ 
ers will come together to hear placed on individual achieve- MIT:edu 
and discuss talks by experts ment, it can neglect the im- 
on such topics as immuno- portance of the institutions 
oncology, exoplanets, election that make the work of science 
polling, and much more. To- possible. That leaves our insti- 
day’s advances and innovations tutions open to attack. And, 
presage a future that most of us indeed, both science and its 
have not yet imagined. institutions are under attack 

Lamentably, we also live today, with rampant skepti- 
in a new heyday of anti- cism about the utility of the 
science activism. Fake news research enterprise and higher 
and “alternative facts” abound. education. Also under attack 
Climate-change deniers occupy as 7 how can we ensure that are the core principles ; that 
political office and determine ‘i : . unite scientists and science 
environmental policy. Fears of SClence pr evails and continues enthusiasts: that objective re- 
unsubstantiated dangers delay to flourish a ality can be discovered; that 
the deployment of genetically anyone can compete in a game 
modified foods in starving na- governed by ideas; that dis- 
tions. The risks of nuclear power are overstated rather | agreements are best resolved by assembling facts to test 
than carefully weighed. The anti-vaccination movement | competing views; and that science and the application of 
endures, and there are claims that science is as cultur- | scientific principles have the capacity to improve lives. 
ally determined and subjective as any other endeavor. | What’s more, science’s universal truths call together peo- 
Public figures cynically dismiss scientific findings, fos- | ple from any background, any nation, any phenotype or 
tering a popular distrust of expertise and experts. All | genotype. These principles have guided us for centuries 
this, too, presages a different future that most of us along the road to discovery and understanding. 
would not want to imagine. The very institutions that support individual inquiry 

In this environment, how can we ensure that science | also guard democratic principles and foster human ad- 
prevails and continues to flourish? What can be done to | vance. They convene people with shared purpose and 
get the most from this scientific golden age? We can start | amplify their impact. It is easy to assume that these 
by recognizing the critical role of institutions in nurturing | institutions can stand on their own, but they cannot. 
the scientific enterprise. All too often science is viewed in | None of science’s successes is solely “mine” or “yours.” 
terms of individual achievements: what someone did to | They are all “ours, and it is our shared responsibility to 
win the Nobel Prize or a MacArthur “genius” award; what | actively defend the institutions that enable them. 
someone else did to achieve tenure or to launch a billion- -Susan Hockfield 
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&6| feel like achump. 99 


Jake McDonald, an environmental health researcher in Albuquerque, New Mexico, 
to The New York Times, on learning that diesel-fume tests he conducted 
on monkeys used Volkswagens rigged to produce artificially low emissions. 
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China to protect wetlands 


= 


China’s decision to regulate construction may help stem the loss of habitat for migrating birds. 


hina plans to dramatically curb commercial development of 
coastal wetlands, habitat that is crucial for almost 500 spe- 
cies of migratory birds. “It is massive good news,” says Nicola 
Crockford of the Royal Society for the Protection of Birds, 
based in Sandy, U.K. Over the past 5 decades, building of sea 
walls and construction on reclaimed wetlands have destroyed 
more than half of the tidal mudflats in China (Science, 9 October 2015, 
p. 150). To help stop the losses, the State Oceanic Administration 
(SOA) announced on 17 January that authorities will halt unauthor- 
ized projects and tear down illegal structures. SOA will approve 
coastal wetland development only if it is important for a few specific 
purposes, such as national defense. Although scientists and conserva- 
tionists would like additional protections, including national legisla- 
tion with tough penalties, they note that the central government has 
been cracking down on offenders and designating new reserves. 
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Funding boost for CRISPR 


BIOMEDICINE | The U.S. National 
Institutes of Health (NIH) will put 

$190 million over 6 years into developing 
gene-editing tools such as CRISPR 

to treat diseases caused by DNA mutations. 
Although companies are pouring money 
into CRISPR for new therapies, the NIH 
funding is meant to solve challenges 

still facing such editing. Among these are 
improving safety tests and successfully 
delivering CRISPR’s components into cells. 
The NIH research will be limited to work 
on somatic cells, not sperm or eggs, which 
means any DNA edits could not be passed 
to subsequent generations. The money 
comes from NIH’s Common Fund, 

which supports research that cuts across 
NIHW’s 27 institutes and centers. 


Safer cigarette claim rejected 


TOBACCO REGULATION | Philip Morris 
International failed last week to convince 

a U.S. federal advisory panel that its new 
smokeless cigarette could reduce the risk of 
harm from smoking. The company’s iQOS 
technology, already available in 30 coun- 
tries, releases nicotine by heating tobacco 
without burning it. The firm had hoped 
that in addition to a still-pending approval 
for the product from the U.S. Food and 
Drug Administration (FDA), it might get 
the agency’s blessing to market the innova- 
tion as safer than conventional cigarettes. 
But FDA’s Tobacco Products Scientific 
Advisory Committee, which consists largely 
of public health experts from academia, 
has a history of challenging the tobacco 
industry. The committee agreed 

that the iQOS system would reduce users’ 
exposure to harmful chemicals compared 
with conventional cigarettes. But it decided 
that Phillip Morris hadn’t proved its 

claim that smokers can reduce their risk 

of tobacco-related diseases by making the 
switch. FDA can disregard the advisory 
panel’s recommendations but rarely does. 


New leader for French science 


RESEARCH POLIcy | Computer scientist 
Antoine Petit, 57, was named president of 
the €3.2 billion CNRS, France’s national 
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research agency, on 24 January. Petit, 

who previously was the president of the 
National Institute for Computer Science 
and Applied Mathematics, says he wants 

to make Europe’s largest research organi- 
zation more “agile and reactive” and offer 
its young staff researchers higher salaries. 
CNRS’s interim president, Anne Peyroche, a 
cell and molecular biologist, left the agency 
prematurely earlier in January. The French 
research ministry declined to say why 
exactly, but her departure may be related 
to allegations of image manipulation in her 
papers. A spokesperson for the Alternative 
Energies and Atomic Energy Commission, 
Peyroche’s main employer, says the agency 
is investigating those accusations, which 
were made on the website PubPeer. 


A photo album made with DNA 


COMPUTER SCIENCE | Think of it as the 
world’s smallest time capsule. Researchers 
at the University of Washington (UW) in 
Seattle announced last week that they 
have begun collecting 10,000 photos that 
they will digitize and encode as strands 

of synthetic DNA for future genera- 

tions to access. The effort, known as the 
#MemoriesInDNA Project, is expected 

to be the largest ever to store digital data 
in DNA. Two years ago, the same group 
reported that it had recorded 200 mega- 
bytes of data, including 100 books and a 
Miles Davis jazz recording, using 2 billion 
nucleic acid bases—the adenines, thy- 
mines, cytosines, and guanines that make 
up DNA. The 10,000 photos will top that, 
depositing a gigabyte of data into tens of 
billions of bases. The point isn’t just to 
increase the storage volume but also to 
help researchers look for new ways 

to search DNA-encoded information—such 
as finding all the images showing a red 
car—without first decoding each image and 
converting it into a digital format, says the 
group’s leader, UW computer scientist Luis 
Ceze. To submit a photo of your own to 

be incorporated, visit the group’s webpage 
at memoriesindna.com. 


FDA axes monkey study 


ANIMAL RESEARCH | The U.S. Food and 
Drug Administration (FDA) in Silver 
Spring, Maryland, last week pulled the 
plug on a monkey study at its National 
Center for Toxicological Research because 
of animal welfare concerns and announced 
a broad review of the agency’s animal 
research programs. The study, which 
observed the effects of nicotine withdrawal 
in young squirrel monkeys to better 
understand addiction in adolescents and 
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young adults, drew widespread attention 
after primatologist Jane Goodall wrote 

to FDA Commissioner Scott Gottlieb last 
September condemning it as unethical. 
FDA suspended the research pending an 
independent review. Research advocates, 
fearing the agency had unduly buckled 

to pressure from animal rights activists, 
demanded more justification for the 
decision. Last week’s announcement 
explains that the suspension was prompted 
by the deaths of four monkeys; FDA said 
it found problems with the care and 
oversight provided by a “third-party 
animal welfare contractor.” In a statement, 
Gottlieb announced plans for an 
independent, third-party investigation of 


the agency’s animal research programs, 
which include approximately 270 non- 
human primates. And he described a newly 
created oversight body within FDA, the 
Animal Welfare Council. Unlike existing, 
federally mandated committees that 
oversee individual studies, the council 

will track animal research across FDA and 
report to its Office of the Chief Scientist. 


Rector resigns in Ecuador 


RESEARCH UNIVERSITIES | The divisive 


head of a university that founders hoped 
would become a research powerhouse in the 
Andes has resigned. Mathematical biologist 
Carlos Castillo-Chavez says he planned 


Former astronaut picked to lead U.S. Geological Survey 


resident Donald Trump plans to nominate James Reilly, a former NASA astro- 

naut and exploration geologist, to lead the U.S. Geological Survey, the White 

House announced last week. If confirmed, the 63-year-old would lead a $1.1 

billion science agency whose researchers monitor for earthquakes and volca- 

nic eruptions, among other duties. Prior to NASA, Reilly worked for Enserch 
Exploration, an oil and gas company based in Dallas, Texas. Reilly, who holds a 
geoscience Ph.D. from the University of Texas in Dallas, retired from NASA in 2008 
and currently serves as a technical adviser on space operations at the U.S. Air Force’s 
National Security Space Institute in Colorado Springs, Colorado. 
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Sensors can switch 
off turbines before birds 
Collide with them. 


RENEWABLE ENERGY 
Al saves birds from turbines 


ta Wyoming wind farm, an energy company will install a new kind of sensor that 

shuts down turbines when eagles are flying near to prevent collisions, the com- 

pany announced last week. Wind farms each year kill thousands of birds of many 

species, including federally protected ones. The fatalities have aroused public 

opposition to the spread of this renewable energy source. Duke Energy, based 
in Charlotte, North Carolina, acted after pleading guilty in 2013 to criminal charges 
that turbines at two of its Wyoming wind farms killed more than 150 migratory birds of 
various species. Duke will deploy 24 bird detection devices at one of those farms, which 
has 110 turbines. The bird detection devices, made by IdentiFlight, based in Louisville, 
Colorado, use pole-mounted cameras and artificial intelligence to identify eagles and 
monitor their flight paths. When an eagle flies too close to a turbine, the networked 
devices will cause it to shut down within about 30 seconds. 
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to serve out his 2-year term as rector of 
Yachay Tech University in Ecuador’s Urcuqui 
province but quit 11 months early to support 
his daughter through a medical crisis. The 
move comes 6 months after several scientists 
in leadership positions were fired abruptly, 
leading to accusations that Castillo-Chavez 
was undermining the university’s mission 
(Science, 28 July 2017, p. 340). Castillo- 
Chavez calls the changes “necessary reforms” 
and says that Yachay Tech has “advanced 

a lot since I arrived.” But Camilo Montes, 

a geologist who left last month, says, “The 
situation is just getting more and more 
unstable. ... The future of Yachay is done, 
with or without [Castillo-Chavez].” 


NIH requires ‘life span’ trials 


CLINICAL RESEARCH | The U.S. National 
Institutes of Health (NIH) has expanded an 
existing rule that requires children’s partic- 
ipation in clinical trials to embrace people 
across the life span, including the oldest 
Americans. NIH announced last week that, 
beginning in January 2019, applicants 
seeking funds for trials must present a plan 
for including subjects of all ages—or justify 
their exclusion, which must be for scientific 
or ethical reasons. The change will also 
compel researchers to report to NIH the 
age of every enrolled subject, numbers the 
agency will make public. The changes were 
mandated by a 2016 law, the 21st Century 
Cures Act. NIH has required participation 
of children in clinical studies since 1998, 
but advocates for pediatric research have 
been frustrated in their attempts to learn 
whether investigators were complying. 
Advocates for research on older Americans 
have also bemoaned data showing that 
they are too often underrepresented 

in clinical trials. 


NASA's GOLD hitches a ride 


EARTH SCIENCE| The first NASA science 
mission to be hosted on a commercial 
satellite was launched on 25 January on 
an Ariane 5 rocket. The Global-scale 
Observations of the Limb and Disk 
(GOLD) project is tagging along aboard 

a geostationary communications 

satellite named SES-14, rather than on a 
government satellite or the International 
Space Station. It’s an economical approach 
NASA plans to use again, including for its 
Geostationary Carbon Cycle Observatory, 
which will be launched in several years 
on another commercial satellite host. 

The $55 million GOLD project is designed 
to explore phenomena at the boundary 
between the upper reaches of Earth’s 
atmosphere and space. 
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SPACE EXPLORATION 


India plans to land near moons south pole 


Chandrayaan-2 orbiter could slake scientists’ thirst for detailed data on lunar water 


By Pallava Bagla, in Bengaluru, India 


ometime this summer, a spacecraft or- 
biting over the moon’s far side, out of 
contact with controllers on Earth, will 
release a lander. The craft will ease to 
a soft landing just after lunar sunrise 
on an ancient, table-flat plain about 
600 kilometers from the south pole. There, 
it will unleash a rover into territory never 
before explored at the surface; all previous 
lunar craft have set down near the equator. 

That’s the ambitious vision for India’s 
second voyage to the moon in a decade, 
due to launch in the coming weeks. If 
Chandrayaan-2 is successful, it will pave 
the way for even more ambitious Indian 
missions, such as landings on Mars and 
an asteroid, as well a Venus probe, says 
Kailasavadivoo Sivan, chairman of the Indian 
Space Research Organisation (ISRO) here. 
Chandrayaan-2, he says, is meant to show 
that India has the technological prowess “to 
soft land on other heavenly bodies.” 

But lunar scientists have much at stake, 
too. “There has been a rebirth of lunar explo- 
ration across the globe, and India can’t be left 
behind,” says Mylswamy Annadurai, direc- 
tor of the ISRO Satellite Centre. Instruments 
aboard the lander and rover will collect data 
on the moon’s thin envelope of plasma, as well 
as isotopes such as helium-3, a potential fuel 
for future fusion energy reactors. The orbiter 
itself will follow up on a stunning discovery 
by India’s first lunar foray, the Chandrayaan-1 
orbiter, which found water molecules on the 
moon in 2009. Before that, “It was kind of 
a kooky science to think that you’d find wa- 
ter” there, says James Greenwood, a cosmo- 
chemist at Wesleyan University in Middle- 
town, Connecticut. “Now, weve arguing 
about how much water, and not whether it 
has water or not.’ Cameras and a spectro- 
meter aboard the Chandrayaan-2 orbiter 
could help settle that question. 

The $150 million mission was originally 
meant to fly 3 years ago, but Russia failed to 
deliver a promised lander, prompting India 
to go it alone. Final preparations are under- 
way on the Chandrayaan-2 spacecraft, which 
will launch from the Sriharikota spaceport 
on the Bay of Bengal aboard India’s Geo- 
synchronous Satellite Launch Vehicle. 


SCIENCE sciencemag.org 


A landing so far from the lunar equator 
is especially tricky. “It is a difficult and com- 
plicated mission,” says Wu Ji, director of the 
National Space Science Center in Beijing. 
Less sunlight reaches the poles, which means 
the lander and rover must be parsimonious 
with power. The plan is to set down in a high 
plain between two craters, Manzinus C and 
Simpelius N, at a latitude of about 70° south. 

The lander will pack as much science as it 
can into its first lunar day—14 Earth days—as 
controllers may not be able to revive it after 
the long lunar night. The craft has a Lang- 
muir probe to measure the moon’s plasma—a 
wispy layer of charged ions that may ex- 
plain why the lunar regolith, or dust, has a 


tendency to float in the thin atmosphere. It 
also has a seismometer for recording moon- 
quakes. Its seismic measurements would 
supplement those from the Apollo landings, 
because readings from high latitudes would 
be sensitive to signals passing through dif- 
ferent parts of the moon. And if the seismo- 
meter is lucky enough to record a sizable 
quake during its operational lifetime, it 
might offer new evidence in a long-running 
debate over what the moon’s core is com- 
posed of, and whether it’s solid. “We just need 
more data to understand the lunar interior,’ 
says David Kring, a planetary geologist at the 
Lunar and Planetary Institute in Houston, 
Texas, who is not involved in the mission. 


Pole position 


If all goes to plan, India’s Chandrayaan-2 
mission this summer will attempt a soft 
landing on an ancient high plain of the 
moon, some 600 kilometers from the 
south pole. It would be the first landing 
so far from the equator. 


Landing skid 


Exploring lunar nova 

The lander is equipped with a seismometer 
to listen for moonquakes and a Langmuir 
probe that will measure fluctuations in the 
wispy plasma enveloping the lunar surface. 


Published by AAAS 


Moon 


Tycho —~SSaaaes 


Sea of 
Serenity 


a Equator 


Grater Cee" _ chandrayaan-2 
S landing site near 
7 southern pole 
Seeking ground truth 


With spectrometers for assaying 
elements in the regolith, the briefcase- 
size rover hopes to make the most 
of the 14-Earth-day lunar day. 
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The briefcase-size rover, weighing just 
25 kilograms, will also carry two spectrom- 
eters for probing the lunar surface’s elemen- 
tal composition. The area is enticing, as it is 
thought to be made up of rocks more than 
4 billion years old that solidified from the 
magma ocean that covered the newly formed 
moon. The data would be compared with 
those from Apollo-era missions that landed in 
other ancient highlands closer to the equator. 

For some scientists, the most anticipated 
data will come from the orbiter’s water map- 
per. Protons in the solar wind generate hy- 
droxyl ions when they strike oxides in the 
regolith. The ions drift to the poles, where 
they are trapped in craters as water ice, which 
the orbiter will inventory. Shedding light on 
the moon’s water circulation “is a worthwhile 
endeavor,’ says Carle Pieters, a lunar scientist 
at Brown University. Locating substantial wa- 
ter, adds Muthayya Vanitha, Chandrayaan-2’s 
project director at ISRO, “could pave the way 
for the future habitation of the moon,” as wa- 
ter is a limiting factor for operating a base. 

Regardless of whether Chandrayaan-2 
breaks new scientific ground, a success- 
ful soft landing near the south pole will be 
a technical accomplishment for India, as 
well as a proud moment for the country. It 
may even benefit other countries’ moon pro- 
grams. “One of NASA’s main priorities is to go 
[to the south pole] on a sample return mis- 
sion,’ Greenwood says, “so this could help us 
also later down the road as they give us more 
information as to what’s there.” & 


With reporting by Katie Langin. 


To simulate the moon’s anemic gravity during 
testing of its lunar rover, the Indian Space Research 
Organisation tethered a model to a helium balloon. 
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Judge orders unmasking 
of anonymous peer reviewers 


Bitter legal battle between two physical fitness groups 
produces unusual court decision 


By Andrew P. Han, Retraction Watch 


n what appears to be a first, a U.S. court 
is forcing a journal publisher to breach 
its confidentiality policy and identify an 
article’s anonymous peer reviewers. 
The novel order, issued last month by 
a state judge in California, has alarmed 
some publishers, who fear it could deter sci- 
entists from agreeing to review draft manu- 
scripts. Legal experts say the case, involving 
two warring fitness enterprises, isn’t likely 
to unleash widespread unmasking. But 
some scientists are watching closely. 

The dispute revolves around a 2013 paper, 
since retracted, that appeared in The Jour- 
nal of Strength and Conditioning Research. 
In the study, researchers at The Ohio State 
University in Columbus evaluated physical 
and physiological changes in several dozen 
volunteers who participated for 10 weeks in 
a training regimen developed by CrossFit 
Inc. of Washington, D.C. Among other re- 
sults, they reported that 16% of participants 
dropped out because of injury. 

In public and in court, CrossFit has al- 
leged that the injury statistic is false. Cross- 
Fit also claims that the journal’s publisher, 
the National Strength and Conditioning 
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Association (NSCA) of Colorado Springs, 
Colorado—which is a competitor in the fit- 
ness business—intentionally skewed the 
study to damage CrossFit. NSCA in turn has 
countersued, accusing CrossFit executives 
of defamation. Amid the legal crossfire, 
the journal first corrected the paper to re- 
duce the number of injuries associated with 
CrossFit, then retracted it last year, citing 
changes to a study protocol that were not 
first approved by a university review board. 

CrossFit suspects the paper’s reviewers 
and editors worked to play up injuries as- 
sociated with its regimen, and it has asked 
both federal and state judges to force the 
publisher to unmask the reviewers. In 2014, 
a federal judge refused that request. But 
last month, Judge Joel Wohlfeil of the San 
Diego Superior Court in California, who is 
overseeing NSCA’s defamation suit against 
CrossFit, ordered the association to provide 
the names. 

That order is an outlier, attorneys famil- 
iar with such disputes say. They note that 
over the past 3 decades at least three other 
parties have unsuccessfully asked courts to 
unmask reviewers, going up against pub- 
lishers including the American Physical 
Society, Elsevier, and the Massachusetts 
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A paper reporting injuries associated with a popular 
fitness regime has sparked a yearslong court battle. 


Medical Society, which publishes The New 
England Journal of Medicine (NEJM). 

Academic publishers can’t claim special 
protections for information provided in 
confidence, unlike journalists who can pro- 
tect confidential sources under state shield 
laws. But judges have generally agreed with 
journals that unmasking reviewers would 
do more harm than good. In one notable 
2007 case, pharma giant Pfizer subpoenaed 
NEJM for information about peer review- 
ers as part of a class action lawsuit related 
to the marketing of its painkillers Bextra 
and Celebrex. In an affidavit submitted on 
behalf of NEJM, Donald Kennedy, former 
president of Stanford University in Palo 
Alto, California, who was then the editor-in- 
chief of Science, stated that were Pfizer to 
win, “I have no doubt whatever that ... sci- 
entists would think twice about the next re- 
viewing assignment, and that many would 
decide not to help.” The judge quashed 
Pfizer’s request. 

Despite such high-profile decisions, jour- 
nals are constantly defending reviewer con- 
fidentiality, says Paul Shaw, an attorney at 
Verrill Dana in Boston who has represented 
NEJM for 17 years. Over that time, Shaw 
estimates he has fielded 30 to 35 subpoe- 
nas asking for information about a peer- 
reviewed article—usually one related to 
drugs or medical devices at the center of 
class action lawsuits. “Invariably,” he says, 
“one of the requests will be for the identity 
of and memoranda done by the peer re- 
viewers.” Usually, Shaw responds by send- 
ing a standard objection letter that puts the 
matter to rest. 

So why did the CrossFit case take a dif- 
ferent turn? One reason is that this time the 
publisher is the plaintiff, not the defendant, 
says Joshua Koltun, an attorney based in 
San Francisco, California, who has reviewed 
the case. Another is that the identity, con- 
duct, and motivations of the reviewers 
could be key to CrossFit’s defense. NSCA, he 
notes, “is saying: ‘I’m going to sue [CrossFit] 
for saying I committed fraud, but I’m not 
going to let you see info that might prove 
[CrossFit’s claim]? You can’t have your cake 
and eat it, too.” 

Shaw, for one, believes the case is so un- 
usual that it won’t have broad ripple effects. 
“T don’t think it has any precedential value 
whatsoever,” he says. “For lack of a better le- 
gal descriptor, the facts seem very messy.” 


Andrew P. Han is a reporter for Retraction 
Watch based in New York City. This story 
is the product of a collaboration between 
Science and Retraction Watch. 
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SPACE WEATHER 


NASA seeks to revive lost 


probe that traced solar storms 
Amateur found IMAGE while looking for spy satellite 


By Paul Voosen 


fter years of silence, a NASA satellite 

is phoning home. Thought lost be- 

cause of a power system failure, the 

Imager for Magnetopause-to-Aurora 

Global Exploration (IMAGE) was dis- 

covered last month, still broadcast- 
ing, by Scott Tilley, an amateur astronomer. 
NASA may find it possible to revive the 
mission, which once provided unparalleled 
views of solar storms crashing into Earth’s 
magnetosphere. “I’ve discovered—or rather, 
‘recovered’——many satellites,” Tilley says. 
“Nothing has resonated like this.” 

By day, Tilley designs marine power 
systems in Roberts Creek, Canada. But he 
spends his free time tracking 
the radio signals from spy sat- 


breath waiting for some real information 
exchange between IMAGE and the ground,” 
Reiff says. 

Prior to its failure, IMAGE was already 
considered a successful mission. The half- 
ton satellite served as a sort of sentry, 
providing a global view of charged par- 
ticles captured by the magnetosphere, the 
envelope around Earth dominated by its 
magnetic field. IMAGE’s instruments de- 
tected the atoms kicked out when particles 
in solar storms crashed into a thin veil of 
hydrogen atoms cloaking the edge of 
Earth’s atmosphere, boosting understand- 
ing and predictions of space weather. The 
observations have never been replaced, 
says Dan Baker, director of the Laboratory 
for Atmospheric and Space 
Physics at the University of 
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identifying itself as “2000- 
017A,” which he realized corre- 
sponded to IMAGE. Launched 
in 2000 and left for dead in 2005, the 
$150 million mission was broadcasting. 
It just needed someone to listen. 

After Tilley blogged about the detec- 
tion, word spread to former members of 
IMAGE’s science team, including Patricia 
Reiff, a space plasma physicist at Rice Uni- 
versity in Houston, Texas. “The odds are ex- 
tremely good that it’s alive,’ she says. There 
also appear to be data in the radio signal 
beyond basic bookkeeping information, 
Reiff adds, a hint that some of the satellite’s 
suite of six instruments are working. 

Since Tilley’s announcement, IMAGE 
mission scientists have been engaged in 
space archaeology, digging up command- 
and-control software backed up on obsolete 
tape cartridges. As of press time, NASA was 
continuing to try to communicate with the 
satellite using its deep space radio anten- 
nas. “The team is collectively holding their 
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Patricia Reiff, 
Rice University 


lite’s radio. It remained pos- 
sible, however, for IMAGE 
to reset itself. When Earth 
blocked its solar panels from the sun for 
an extended time, its batteries could drain. 
Upon recharging after such eclipses, the 
spacecraft could reboot—but when it failed 
to do so after a 2007 eclipse, NASA declared 
the mission over. Several of these eclipses 
have occurred in the past few years, and 
indeed, another astronomer, Cees Bassa at 
the Netherlands Institute for Radio Astron- 
omy in Dwingeloo, has now discovered in 
his records that IMAGE was broadcasting 
as far back as October 2016. 

If IMAGE is revived, it will be well po- 
sitioned in its orbit to monitor Earth’s 
northern auroral zone, a region that space 
weather can hit hard. It’s thrilling to think 
the spacecraft could be back, Reiff adds. Its 
rediscovery reminds her of the mission’s 
motto: “The real voyage of discovery is not 
in seeking new landscapes, but in having 
new eyes.” 
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AGING RESEARCH 


Forever young? Naked mole 
rats may know the secret 


A new study claims the strange rodents flout the 
mathematics of mortality—but not everyone is convinced 


By Kai Kupferschmidt, 
in San Francisco, California 


ver a lifetime of studying naked mole 
rats, Rochelle Buffenstein has often 
been surprised, even shocked, by 
the extraordinary features of these 
freakish-looking rodents. But when 
she recently tallied the life histories 
of thousands of animals born in her lab, she 
reached the most startling conclusion yet: 
There may be no such thing as a lifetime 
for naked mole rats. As the animals grow 
older (and they can grow very old indeed) 
their risk of dying does not seem to in- 
crease, making the concept of a maximum 
life span meaningless, she wrote in a paper 
published in eLzfe last week. “To me this is 
the most exciting data I’ve ever gotten,” says 
Buffenstein, who works at Calico, a Google 
biotech spinoff here that focuses on longev- 
ity. “It goes against everything we know in 
terms of mammalian biology.” 
Although researchers who study aging 
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welcomed the trove of data in the paper, 
many cautioned against sweeping conclu- 
sions. “I think it’s too early to say naked 
mole rats are nonaging animals,” says Joao 
Pedro De Magalhaes, a gerontologist at the 
University of Liverpool in the United King- 
dom. Still, “It is clear that something very 
unusual is going on in these animals,’ he 
says. “This study further substantiates that 
naked mole rats truly could unlock keys to 
healthy aging,” says Ewan St. John Smith, 
a pharmacologist who studies naked mole 
rats at the University of Cambridge in the 
United Kingdom. 

Native to the Horn of Africa and Kenya, 
naked mole rats were already an outlier 
among mammalian species. Like some bees 
and almost all ants, the burrowing rodents 
are eusocial: Each colony has only a single 
breeding female. Their crumpled, sausage- 
shaped bodies are resistant to some types of 
pain, and they can survive 18 minutes with- 
out oxygen and rarely develop cancer. 

Few have been studying their pecu- 
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For naked mole rats, death is “stochastic,” 
says Rochelle Buffenstein, who has studied 
the species since 1980. 


liarities longer than Buffenstein. Born 
in Rhodesia—modern-day Zimbabwe—she 
started working with naked mole rats at 
the University of Cape Town in South Africa 
in 1980, where she studied with anatomist 
Jennifer Jarvis, who in 1981 first reported 
that the rodents are eusocial. On a field expe- 
dition to Kenya, the two caught many naked 
mole rats together, some of which became 
the seed of Buffenstein’s own colonies. 

The animals have accompanied her to 
professorships in Johannesburg, South 
Africa; New York City; and San Antonio, 
Texas, where she studied their vitamin D 
metabolism, thermoregulation, and the 
energetic cost of pregnancy, among other 
things. “She really has been a catalyst to 
the field of naked mole rat biology,” says 
St. John Smith, not least by helping and 
encouraging colleagues. “For example, she 
was willing to provide me with an intact 
colony of animals,” he says. 

Buffenstein now has some 3000 ani- 
mals in 147 colonies, more than any other 
researcher; they're housed in a basement 
an hour’s drive from her Calico office. The 
animals seem contented, noisily gnaw- 
ing with their long teeth at their Plexiglas 
housing, made up of cylinders and boxes 
that each colony variously uses for storage, 
as a toilet, or to huddle together to sleep 
in a huge pink mass. Buffenstein herself is 
happy at Calico as well, she says. “Here you 
are liberated from writing grants [and] you 
are all fighting for the same goal, which is 
to find a way to slow aging.” 

Researchers already knew naked mole 
rats can have very long lives. Mice in cap- 
tivity live at most 4 years; based on their 
size, naked mole rats would not be expected 
to live past 6. But by the time Buffenstein 
moved to New York City in 2001, many of 
her animals were more than 15 years old. “I 
thought: ‘Wow, they are living a long time; 
we should start understanding why.” For the 
new study, she pulled together the records of 
when each animal in her care was born and 
when it died, was killed for an experiment, 
or given away to other researchers. 

The data suggest that it’s not just their 
longevity that is unusual. All mamma- 
lian species were thought to adhere to the 
Gompertz law, a mathematical equation that 
describes aging published by U.K. math- 
ematician Benjamin Gompertz in 1825. He 
found that the risk of dying rises exponen- 
tially with age; in humans, for instance, it 
doubles roughly every 8 years after the age 
of 30. Scientists have subsequently shown 
that in most mammalian species, the law 
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kicks in when they reach an age two to five 
times their age of sexual maturity. 

But not naked mole rats. After they 
reached sexual maturity at 6 months of age, 
each animal’s daily chance of dying was 
a little more than one in 10,000, Buffen- 
stein found, and it stayed roughly the same, 
and even went down a little, through- 
out their lives—even after they reached 
25 times the age of sexual maturity. Dy- 
ing, for naked mole rats, is “stochastic,” 
Buffenstein says. “It’s like radioactive decay.” 

Caleb Finch, a biogerontologist at the 
University of Southern California in Los 
Angeles, agrees that the animals’ mortal- 
ity is “remarkably low.” “At advanced ages, 
their mortality rate remains lower than 
any other mammal that has been docu- 
mented,” he says. But more data on older 
naked mole rats are needed to be sure 
that their risk of dying really is flat, he 
says. Fewer than 50 in the study lived past 
15 years of age, because many of the ani- 
mals were killed or moved to other labs. 

Buffenstein says her data are enough to 
show that the animals don’t age. “If you 
look at any rodent aging study, 100 ani- 
mals is all you need to see Gompertz aging. 
Here we have 3000 data points and we’re 
not seeing it,” she says. “To me it says that 
these animals really have figured out a way 
to slow aging.” 

Still, the oldest animal currently liv- 
ing in Buffenstein’s lab is 35, and few are 
older than 30. Matthias Platzer, a biologist 
at the Leibniz Institute on Aging in Jena, 
Germany, says it’s still not clear what hap- 
pens when naked mole rats age beyond 
30 years. “Maybe aging happens really fast 
then? Even Rochelle Buffenstein does not 
have the data on this,” he says. 

The big question is how naked mole rats 
stay so young. Their low body temperature 
of about 32°C may help them avoid accu- 
mulating cellular and molecular damage, 
says Magalhaes, and there is evidence that, 
compared with other mammals, they have 
better DNA damage repair and are more 
efficient at getting rid of misfolded pro- 
teins. Buffenstein says she hopes to iden- 
tify a master switch controlling all these 
antiaging measures. “We’re pretty certain 
that with so many things being different 
there has to be something upstream regu- 
lating the whole process.” 

Although the naked mole rat is a spe- 
cial case, it could end up being the key to 
understanding aging in other mammals, 
including people, says Buffenstein, just as 
studying squid giant axons helped neuro- 
scientists fathom how nerve cells work. “I 
would argue,” she says, “that most of our 
biggest discoveries in biology have been 
made using freak animals.” 
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Accelerator boom hones 
China’s engineering expertise 


Bevy of particle accelerators could pave the way for future 
collider that would be world’s largest 


By Dennis Normile, in Dongguan, China 


iming a stream of protons at a target 

to produce neutrons for experiments 

is “so hard to do,” says Andrew Taylor, 

former director of the ISIS Neutron 

and Muon Source near Oxford, U.K. 

Most accelerators generate electron 
beams; accelerating and controlling protons, 
which are 2000 times as heavy, is far more 
complex and requires higher magnetic fields. 
The targets themselves pose challenges, too. 
In raising the curtain on the China Spall- 
ation Neutron Source (CSNS) here, China 
has joined just four other nations in having 
mastered the technology. The $277 million 
facility, set to open to users this spring, is 
expected to yield big dividends in materials 
science, chemistry, and biology. 

More world class machines are on the way. 
China is starting construction on four more 
major accelerator facilities this year (see 
table, below). The building boom is prompt- 
ing a scramble to find enough engineers and 
technicians to finish the projects. But if they 
all come off as planned, the facilities collec- 
tively will put China “on par with or leading 
the world in the design, construction, and 
exploitation of accelerator-based research 
and applications,” says Wu-Tsung Weng, an 


accelerator physicist at Brookhaven National 
Laboratory in Upton, New York. 

That would position China to tackle the 
next global megaproject: a giant accelerator 
that would pick up where Europe’s Large 
Hadron Collider (LHC) leaves off. With a 
$6 billion price tag and a planned circum- 
ference of at least 50 kilometers, the Circu- 
lar Electron Positron Collider (CEPC) would 
study the Higgs boson in detail. A possible fu- 
ture upgrade that would cost billions of dol- 
lars more, the Super Proton-Proton Collider, 
would search for physics beyond the stan- 
dard model at several times the LHC’s energy 
levels. The CEPC, which backers hope to see 
built by 2030, would “put China in a domi- 
nant position in fundamental physics in the 
world in the 21st century,” says Shing-Tung 
Yau, a mathematician at Harvard University. 
Many high-energy physicists in China and 
abroad think the CEPC is an obvious next 
step, though researchers in other disciplines 
worry it will monopolize available resources. 

The current construction binge reflects a 
late start and pent-up demand. The Institute 
of High Energy Physics (IHEP) in Beijing 
inaugurated China’s first particle accelera- 
tor, the Beijing Electron Positron Collider, 
in 1989; it was followed 2 years later by the 
Hefei Light Source, a facility designed to 


Accelerating Chinese science 


China is ramping up its capacity in accelerator physics with several machines nearing completion or coming online 


in the next decade. Particle physicists hope the buildup will climax with the Circular Electron Positron Collider. 
FACILITY (LOCATION) RESEARCH TARGET COST STATUS/SCHEDULE 

China Spallation Neutron aterials science, physics, 277 million Opening to users in spring 
Source (Dongguan) chemistry, and life sciences 

Shanghai Soft X-ray Free- aterials science and 110 million Soft x-ray FEL opening to 
Electron Laser (FEL), Hard X-ray _ life sciences users in mid-2019 

FEL (Shanghai) Hard x-ray FEL online in 2025 
High-Energy Photon aterials science, chemistry, 730 million = Mid-2025 

Source (Beijing) and biomedicine 

China Initiative Accelerator uclear waste transmutation 280 million 2024 

Driven System (Huizhou) and future energy technologies 

High-Intensity Heavy ion Atomic and nuclear physics 240 million 2024 

Accelerator Facility (Huizhou) 

Circular Electron Positron Particle physics 6 billion Under study 


Collider (site TBD) 
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produce synchrotron radiation for studies 
of molecules and materials. The next major 
accelerator—another effort to catch up with 
other countries—was the Shanghai Synchro- 
tron Radiation Facility, completed in 2009. 
By then, China’s physicists were thinking 
bigger. Next on the wish list was a power- 
ful spallation neutron source. By the early 
2000s, the United Kingdom and Switzer- 
land had such machines in operation, and 
the United States, Japan, and a European 
consortium were either building or plan- 
ning comparable facilities. It was clear that 
“neutron scattering would be very important 
for many fields,’ says Chen Hesheng, CSNS 
project manager. Neutrons interact weakly 
with other particles, making them better 
than synchrotron x-rays at probing cells or 
substances without damaging them. Just 
as the CSNS was getting underway, China’s 
physicists got a windfall. In 2010, the Na- 
tional Development and Reform Commis- 
sion (NDRC), China’s primary funder of 
big science projects, agreed to build several 
more accelerators; four won funding under 
China’s 13th 5-year plan, starting in 2016. 
Technological development for the new fa- 
cilities is ambitious, Taylor says. A hard x-ray 
free-electron laser, being built by the Shang- 
hai Institute of Applied Physics (SINAP) and 
ShanghaiTech University, employs super- 
conducting cavities to accelerate electrons 
that shed x-rays. That approach, used in 
only three other facilities in the world, yields 
higher energies, says Wang Dong, an acceler- 
ator physicist at SINAP. A heavy-ion accelera- 
tor now rising at a new complex in Huizhou, 
China, will have a higher beam intensity than 
comparable machines elsewhere, providing 
advantages in studying atomic and nuclear 
structures and the origin of heavy elements, 
explains Chen Xurong, a nuclear physicist 
heading the project for the Institute of Mod- 
ern Physics in Lanzhou, China. The Huizhou 
complex will also host an accelerator-driven 
system that will experiment in nuclear waste 
transmutation and energy production. 


Groundbreaking is slated this year for 
what would become one of the world’s bright- 
est x-ray sources, IHEP’s High-Energy Photon 
Source in Beijing. It owes its intense bright- 
ness to a beam bending angle that cuts en- 
ergy losses, says project manager Qin Qing. 
It, too, relies on superconducting cavities. 

The CEPC likely would rely on similar 
cavities, says Lou Xinchou, an IHEP deputy 
director in charge of planning the mega- 
accelerator. The High-Energy Photon Source, 
he notes, also requires know-how in cooling 
and control technologies that will pay off for 
the CEPC. 

IHEP is now working on a conceptual de- 
sign for the CEPC, due out in April. But some 
physicists wonder whether the scientific pay- 
off will be worth the investment, even though 
China anticipates that international partners 
will bear a hefty share of the machine’s cost. 
In 2016, NDRC rejected IHEP’s request for 
$100 million for R&D on the megacollider 
during China’s current 5-year plan, forcing 
the institute to scrape together funds from 
other sources. “This is a great project for 
China and the world,” Yau argues. He notes 
that 300 scientists from nine countries con- 
tributed to the preliminary design report. 

In the meantime, the flurry of activity in 
planning and completing the approved ac- 
celerator projects has left IHEP staff “over- 
loaded,” says the CSNS’s Chen Hesheng. His 
institute and others are recruiting from the 
Chinese diaspora to get the job done. Lou 
is one example. Mainland-born, he took a 
leave from a tenured position at the Uni- 
versity of Texas in Austin to work on the 
CEPC. Adequate funding is another chal- 
lenge, as construction spending squeezes 
budgets for operating the new facilities. 
The CSNS, for example, planned for five ex- 
perimental instruments, but is starting with 
three because construction costs rose more 
than anticipated. 

Still, as IHEP’s Wang Sheng puts it: “It’s 
a good time to be an accelerator physicist 
in China.” & 
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The new China Spallation Neutron Source is one of just a handful of comparable facilities worldwide. 
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Dams nudge 
Amazon's 


ecosystems 
off-kilter 


Decline of giant catfish 
emblematic of habitat 
fragmentation 


By Barbara Fraser, in Lima 


nce upon a time, thousands of do- 
rados, a giant among catfish, would 
swim more than 3000 kilometers 
from the mouth of the Amazon River 
to spawn during the austral autumn 
in Bolivia’s Mamoré River, in the foot- 
hills of the Andes. But the dorado, which 
can grow to more than 2 meters in length, is 
disappearing from those waters, and scien- 
tists blame two hydropower dams that Brazil 
erected a decade ago on the Madeira River. 
“The dams are blocking the fish,’ says 
Michael Goulding, a Wildlife Conservation 
Society aquatic ecologist in Gainesville, 
Florida, who has been studying the dorado 
since the 1970s. They are “probably on their 
way to extinction” in Peru and Bolivia. 
Most Amazon dams are in Brazil, where 
scientists have raised concerns about the 
displacement of local communities and 
emissions of greenhouse gases such as car- 
bon dioxide and methane from large res- 
ervoirs. But as countries seek new energy 
sources to drive economic growth, a surge 
in dam construction on the eastern flank 
of the Andes could further threaten fish 
migration and sediment flows, Elizabeth 
Anderson, a conservation ecologist at Florida 
International University in Miami, and col- 
leagues warn this week in Science Advances. 
For the ecology of the western Amazon 
Basin, where the mountains meet the low- 
lands, the main consequence of prolifer- 
ating dams is habitat fragmentation (see 
map, p. 509). Interference with spawning is 
one facet. Another is that dams hold back 
sediments and nutrients that nourish the 
Amazon Basin, Anderson says. Her team 
documented 142 hydropower dams that are 
operating or under construction on head- 
waters in the western Amazon Basin, and 
another 160 that are under consideration. 
If even a fraction of the planned projects 
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Catches of dorado are in decline as the catfish is failing to navigate two dams on its epic spawning run. 


is completed, the habitat disruption could 
have a cascade of ecosystem effects with 
devastating consequences, scientists say. 
The disappearance of the dorado 
(Brachyplatystoma rousseauxii) from the 
Mamoré River suggests fragmentation is 
already taking a toll. And that’s despite 
features of the dams that are meant to 
mitigate their impact. The Madeira dams, 
for example, are designed to allow fish to 
pass: The lower dam has a bypass chan- 
nel and the upper dam has an enclosure 
in which fish are captured. They are then 
trucked upstream for release 
into the reservoir. Perhaps be- 
cause of variations in currents 
or water chemistry, dorados 
are not using the channel, says 
Carolina Rodrigues da Costa 
Doria, an ichthyologist at the 
Federal University of Rond6- 
nia in Porto Velho, Brazil. The 
threat may not be limited to 
fish: Freshwater dolphins and 
river otters may also migrate 
along Amazonian rivers, and 
how dams affect their behav- Lt 
ior is unknown, says Paul Van 
Damme, director of the Insti- 
tute for Applied Research on 
Water Resources, a research 
center in Cochabamba, Bolivia. 
During the rainy season— 
from November to May—water 
levels rise in the Amazon Basin, 
flooding large tracts of forest. 
Various species of fish swim into 
the forest, where they feed on 
fruits—and later disperse seeds. 
By blocking migration routes 
or changing water levels, dams 
change seed dispersal patterns, 
says Sandra Bibiana Correa, 
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a freshwater ecologist at Mississippi State 
University in Starkville. The fish-forest pas 
de deux “is a really delicate interaction,’ she 
says. “It has been going on for tens of millions 
of years. We can disrupt that very easily.” 
Some species are taking advantage of 
the disorder. Another kind of giant catfish 
known as the manitoa or piramutaba once 
rarely ventured upstream of rapids that pre- 
dated the Madeira River dams. Unlike its 
cousin, this species (B. vaillantit) can make it 
through both dam bypasses and into the up- 
per reservoir, and from there swims another 


Parting the waters 


Anew analysis forecasts severe habitat fragmentation in the western Amazon Basin 
if some of the 160 planned dams are built in the region, where 142 dams are already. 
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1000 kilometers or so upstream to Peru’s 
Madre de Dios watershed. Whether the new- 
comer will fill the dorado’s ecological role or 
prey on different fish and thus skew species 
assemblages is unclear, says Carlos Cafias, a 
river ecologist at the Wildlife Conservation 
Society here, who plans to monitor the mi- 
gration of large catfish in the watershed. 
Two other threats—climate change and the 
deforestation that accompanies road building 
during dam construction—could amplify the 
severity of ecological deterioration, Anderson 
says. At stake, she says, are the livelihoods of 
indigenous peoples who depend 
on fishing. Other projects, such 
as Peru’s plans to dredge rivers 
to improve navigability, could 
exacerbate the ecological impact 
by changing flows, disturbing 
500 spawning sites, and disrupt- 
ing the river-forest connection, 
says Fabrice Duponchelle of the 
French Institute of Research for 
Development in Marseille. 
Amazon nations should work 
together to craft a basin-wide 
management plan for migra- 
tory fish, says Thomas Lovejoy, 


He aka a tropical ecologist at George 
Mason University in Fairfax, 
. Virginia. “You have to manage 
the Amazon as a system,’ he 
says. It’s not too late to preserve 
much of the ecology of the west- 
Mamoré ern Amazon Basin, where “we 
River still have a lot of free-flowing 


rivers,’ Anderson adds. “There 
is a huge opportunity to pro- 
tect at least a subset of them by 
thinking at a regional scale.” 


Barbara Fraser is a freelance 
journalist in Lima. 
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HUMAN NATURE, 


OBSERVED 


For decades, two psychologists have kept 
watch over 1000 New Zealanders, teasing 
out factors that shape a life's course 


By Douglas Starr 


n 1987, Avshalom Caspi and Terrie 
Moffitt, two postdocs in psychology, had 
adjacent displays at the poster session 
of a conference in St. Louis, Missouri. 
Caspi, generally not a forward man, 
looked over at Moffitt’s poster and was 
dazzled by her science. “You have the 
most beautiful data set,” he said. Not 
one to be easily wooed, Moffitt went to 
the university library after the meeting and 
looked up Caspi’s citations. Yep, he’d do. “It 
was very nerdy,’ Caspi recalls. “We fell in love 
over our data.” 

It’s been a personal and scientific love af- 
fair ever since. For nearly 30 years, Moffitt 
and Caspi have been collaborating on one 
of the more comprehensive and probing 
investigations of human development ever 
conducted. Launched in 1972, the Dunedin 
Multidisciplinary Health and Development 
Study is as fundamental to human develop- 
ment as the Framingham Heart Study is to 
cardiovascular disease and the Nurses’ 
Health Study is to women’s health. From de- 
tailed observations of the life courses of about 
1000 New Zealanders, Dunedin has spun out 
more than 1200 papers on questions from the 
risk factors for antisocial behavior and the 
biological outcomes of stress to the long-term 
effects of cannabis use. Moffitt, who joined 
the study in 1985, and Caspi, who followed, 
have led much of the work. They “have done 
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so much it’s impossible to pigeonhole them,” 
says Brent Roberts, a psychologist at the Uni- 
versity of Illinois in Champaign who has col- 
laborated with the now-married couple. 

One early finding, on the transient nature 
of most juvenile criminality, was cited in the 
U.S. Supreme Court’s 2005 decision to pro- 
hibit the execution of underage murderers. 
Moffitt and Caspi did pioneering research 
showing that self-control in early childhood 
predicts health and happiness in adults. They 
detailed how the genetic makeup of certain 
individuals can make them vulnerable to 
specific stresses, elucidating the complex 
interplay between genes and life experience. 
And last year, the Dunedin team published a 
study that drew on decades of data to show 
that, contrary to conventional belief, the vast 
majority of people experience mental health 
problems over the course of their lifetime. 

“Their work has transcended psycho- 
logy to influence thinking in psychiatry, 
genetics, criminology, epidemiology, socio- 
logy, and many other areas,” the American 
Psychological Association said when it 
awarded Moffitt and Caspi its 2016 Award 
for Distinguished Scientific Contribution. 

The field of psychology is rich with 
longitudinal studies, going back to 1946, 
when the United Kingdom’s Medical Re- 
search Council began a survey of more than 
5000 people from birth to old age. Other 
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researchers have followed identical and 
fraternal twins over time to tease out the 
influence of nature versus nurture. Such 
studies, although slow, make it possible to 
observe phenomena in real time instead of 
having to reconstruct them from subjects’ 
memories or medical records, or by com- 
paring disparate groups. 

Dunedin is the Goldilocks of longitudinal 
studies. It’s not the biggest or the longest; 
but its high retention rate—about 95% of 
the original cohort has stayed with the study 
since it launched—and the intimacy of the 
data-gathering process make the group one 
of the most closely examined populations on 
Earth. Every few years, the team conducts in- 
tensive cognitive, psychological, and health 
assessments. They interview every member 
of the research cohort as well as their teach- 
ers, families, and friends and review their 
financial and legal records, promising them 
complete confidentiality in return for the 
fullest possible picture of their lives. 

As a result, Moffitt, Caspi, and their 
colleagues have been able to tease out 
previously unseen patterns in human devel- 
opment. “It’s been a remarkable resource,” 
Columbia University psychiatrist Ezra Susser 
says of the study. “There really is no equiva- 
lent,’ Roberts adds. “Time after time they’ve 
anticipated where we've needed to go and 
done the work earlier than the rest of us.” 
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MOFFITT GREW UP in central North Caro- 
lina, where her Scotch-Irish ancestors had 
settled in the 1700s. She helped with farm 
chores and rambled in the woods, collect- 
ing arrowheads and catching fireflies, often 
in the company of her beloved Grandma 
Macon. “Respect is earned, not demanded,” 
Grandma Macon had taught her. “A woman’s 
beauty is in her strength and in her laugh.” 

Red haired and blue eyed, Moffitt was 
the first person in her extended family to 
attend college. Having come from a “low- 
quality high school,” she chose easy classes 
at the University of North Carolina in Chapel 
Hill, but when professors recognized her po- 
tential, she warmed to the challenge of ad- 
vanced courses in psychology and behavioral 
pharmacology. She supported herself as a lab 
tech in nearby Durham’s Research Triangle 
Park—a job she got because having grown up 
hunting rabbits and squirrels, she didn’t feel 
queasy about sacrificing lab animals. 

In graduate school at the University of 
Southern California in Los Angeles, work- 
ing with the famed psychologist Sarnoff 
Mednick, she focused on the roots of crimi- 
nal behavior. A few months before defend- 
ing her Ph.D. dissertation, Moffitt and a few 
classmates decided to relax by taking sky- 
diving lessons. It didn’t go well. Moffitt broke 
her leg in several places. Confined to a wheel- 
chair, she spent several hours entertaining a 
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visitor from New Zealand who had come to 
speak about his study. Phil Silva had been 
tracking all 1037 children born in the Queen 
Mary Maternity Centre at Dunedin Hospital 
from April 1972 to March 1973, examining 
them every couple of years to assess their 
cognitive and psychological development. 

Moffitt saw the population as a readymade 
laboratory to explore some of her ideas about 
early childhood and the roots of criminal- 
ity. She joined the study when the children 
were 13 years old. Not long after earning her 
Ph.D., she made her first trip to New Zea- 
land, gathering data about girls who got in 
trouble with the police. She presented her 
findings at that fateful St. Louis conference. 
Next to her was a guy with olive skin, soulful 
eyes, and brown hair in a ponytail. 

Caspi, his hair now silver but still often in a 
ponytail, grew up on a tiny kibbutz in Israel’s 
Negev Desert. The son of a Yemenite father 
and a Lithuanian mother, he was part of the 
“peopling” of the Negev envisioned by Israel’s 
founding prime minister, David Ben-Gurion, 
who had retired to a cottage up the street. 
Every morning, Caspi and his five class- 
mates would walk past Ben-Gurion’s cottage 
on the way to their little school. “Shalom, 
David,’ they’d call out to him. “Shalom ye- 
ladim [children];’ Ben-Gurion would reply. 
Caspi was 10 when his family moved to 
Berkeley, California, where his father com- 
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In a New Zealand testing center, Terrie 
Moffitt (left) and Avshalom Caspi 
(right) probe their subjects’ physical 
and mental condition. 


pleted his Ph.D. in international 
studies. “The transition from a 
kibbutz to Berkeley was pretty 
amazing,” Caspi recalls. “This 
was the ’70s, and I remember 
dodging riots with my dad.” 

His experiences had made him 
curious about people and places, 
and he thought about becoming 
a journalist. But psychology won 
him over in college, and at Cor- 
nell University he did graduate 
work with sociologist Glen Elder, 
who wrote Children of the Great 
Depression: Social Change in Life 
Experience, a landmark study of 
how the economic environment 
influences human development. 
Caspi became fascinated by the 
interplay between people's dispo- 
sitions and their social environ- 
ment. He was presenting a study 
on how ill-tempered children be- 
come ill-tempered adults when a 
young woman with crimson hair 
set up her poster next to his. 

A few years of long-distance 
courtship ensued, before the couple settled 
in at the University of Wisconsin in Madi- 
son. Colleagues warned against making 
their personal partnership a professional 
one, but they plunged ahead, and Caspi 
joined the Dunedin team. 

His first collaboration with Moffitt in- 
volved a question that had long intrigued 
him: Do dramatic life events change people, 
or simply cement who they already are? The 
team focused on female puberty. Just over 
half of the Dunedin study subjects were girls, 
who by age 15 had been given physical and 
psychological exams since childhood. Previ- 
ous studies showed that reaching puberty 
early is especially stressful, and Caspi and 
Moffitt found that the girls who had the 
most trouble adjusting to early adolescence 
were those who had shown behavioral prob- 
lems in early childhood. That contributed 
to Caspi’s “accentuation” hypothesis—that 
stressful transitions tend to accentuate who 
we basically already are. It was a radical no- 
tion in the early 1990s, when psychologists 
generally believed that situation, not temper- 
ament, was key to determining personality. 


SINCE THEN, their work has often explored 
the darker side of human nature. Contrary to 
what the idyllic landscapes in the Lord of the 
Rings films might suggest, New Zealand is 
not heaven on Earth. Economic inequality is 
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A lifetime of discoveries 


For decades, Terrie Moffitt, Avshalom Caspi, and other researchers have used a variety of tools to monitor a cohort of about 1000 New Zealanders. Multiple findings 
show that the course of their lives can largely be traced to their health, environment, and temperament as children. © Key findings © Study milestones 


1 Temperament 
Girls who have 
behavioral problems 
as children have more 
trouble adjusting to 


2 Personality continuities 
Undercontrolled 3-year-olds 
tend to grow up impulsive 
and antisocial; inhibited 
3-year-olds tend to become 


early puberty. unassertive and depressed. 
©» ovo ff 
1972-3 1975 1985 


1 Perinatal data recorded 
for babies born in 
Dunedin Hospital's Queen 
Mary Maternity Centre. 


2 Psychologist Phil 
Silva launches study, 
enrolling and following 
1037 3-year-olds. 


comparable to that of the United States, and 
addiction, suicide, and assault rates are simi- 
lar. By promising strict confidentiality, the re- 
search team gets shockingly frank confessions 
from their subjects, including about brutality 
and criminal behavior. (A startling example: 
Women admitted to physically abusing their 
spouses as often as men, although they gen- 
erally inflicted less physical damage.) “We 
have a policy of never interfering and never 
ratting them out,” Moffitt says. “And the New 
Zealand police, who understand the value of 
our research, have never asked.” 

This long, intimate surveillance enabled 
Moffitt to track a troubled subset of the 
Dunedin cohort. Delinquent behavior is 
known to peak between the late teens and 
mid-20s, mostly in men. By 15, about a third 
of the study’s boys took part in some degree 
of delinquency, Moffitt found, while a subset 
offended more frequently. When she looked 
at data from early childhood, she found that 
the habitual offenders had been making 
trouble from age 3, and had arrest records 
starting before their teen years. 

Over the years, Moffitt reported in a se- 
ries of papers that these boys did poorly in 
neuropsychological tests (such as verbal 
skills and verbal memory), measured high for 
impulsivity, and were likely to engage in sub- 
stance abuse as they grew older. In 2002, she 
reported that at age 26 this same group was 
committing most of the crimes in the com- 
munity—a pattern that persisted well into 
their 30s. In short, whereas many boys ex- 
hibited “adolescent-limited” criminal behav- 
ior, about 5% were “life-course-persistent” 
offenders. The work had important implica- 
tions for social work and law, and won Moffitt 
the 2007 Stockholm Prize in Criminology. 

In 2000, after Silva retired, Richie Poulton, 
a psychologist at the University of Otago 
in Dunedin, became project director, with 
Moffitt continuing as associate director. Ev- 
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3 Violence and gender 
Women are as likely as 
men to commit intimate 
partner violence. 


4 Schizophrenia 
Hallucinations in 
childhood predict 
an increased risk of 
adult schizophrenia. 


 @ 


1990 


3 Moffitt joins study; 
neuropsychological testing 
added. Permission to collect 
police records secured. 


members turn 18, ques 
on sexual practices are 


ery couple of years Moffitt and Caspi would 
visit Dunedin to spend a few months with 
Poulton collecting information, then return 
home to analyze data. (A few years earlier 
they had also initiated a study of 1100 U.K. 
families with twins that added prodigiously 
to their Dunedin data.) 

The testing schedule in New Zealand, lim- 
ited at first, expanded as they added new 
techniques and technologies, such as DNA 
analysis, retinal imaging (which can help 
gauge the brain’s vascular health), and scans 
of brain activity. The study leaders trained 
staff to interview participants, their fami- 
lies, and associates, and they employed data 
managers to handle the masses of coded 
data. And they maintained a high retention 
rate by assiduously keeping track of the par- 
ticipants, a quarter of whom have emigrated 
from New Zealand, financing travel back to 
the study site, and visiting study members 
who were in prison or too sick to travel. 


Data from the Dunedin subjects’ childhood years make 
it possible to trace early influences on their later lives. 
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5 Persistent offenders 
People who are violent and 
antisocial as children grow into 
violent adults more often than 
those whose antisocial behavior 
is limited to adolescence. 


off. 


4 Caspi joins study, adds 
personality measures. As cohort 


6 Genes and environment 
Among maltreated children, 
those who produce low levels 
of the enzyme monoamine 
oxidase A are more likely to 
become violent adults. 


5 Questions about 6 DNA 
work, finances, collection added 
ions intimate partner and biobank 


added. violence are added. is established. 


“They have gotten to know these people in- 
timately, watched them age, get married, have 
children, and encounter the things that hap- 
pen in middle life,” says James Tabery, a phi- 
losopher at the University of Utah in Salt Lake 
City who has written about the study. It can be 
harrowing, he says. “What do you do if one of 
them is on the verge of homelessness? Or if one 
of them is $1000 short on rent? It’s like David 
Attenborough watching a giraffe get eaten but 
it’s a giraffe you’ve come to know for 40 years.” 
Researchers can unburden themselves in a 
safe room after listening to upsetting infor- 
mation. Outside of that room, anyone who 
violates confidentiality will be fired. 


THE STUDY’S FINDINGS mostly point to pat- 
terns, not mechanisms. But in the mid- 
1990s, Moffitt and Caspi joined a group at 
King’s College London looking for genetic 
roots of behavior. The result was what may 
be the highest profile and most controversial 
of their claims. 

It’s commonly known that children who 
are abused often become violent adults. In 
the Dunedin group, for example, about half 
the boys with abusive childhoods grew into 
men prone to committing crimes. Moffitt and 
Caspi thought that studying this group could 
shed light on the classic nature versus nur- 
ture question: Do experiences alone produce 
a life of crime, or are some individuals natu- 
rally prone to it? The idea had come to them 
on safari in Namibia, where they noted how 
even in that mosquito-infested region many 
people did not get malaria, undoubtedly be- 
cause of a genetic resistance. Could genotype 
play a role in behavior as well, making certain 
children psychologically resistant to abuse? 

Earlier research had linked aggression to 
low levels of an enzyme called monoamine 
oxidase A (MAO-A), which breaks down cer- 
tain neurotransmitters in the brain. Labora- 
tory mice bred without the enzyme showed 
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8 Self-control 
Childhood self-control 
predicts physical health, 
financial success, 

and a lack of criminal 
behavior among adults. 


° 
000 ffo 


7 Stress and illness 
People who were abused 
as children have higher 
markers of inflammation, 
indicating an elevated risk 
of cardiovascular disease. 


7 Directorship 8 Study members’ 


changes from relatives are asked about 
Silva to Richie family history of physical 
Poulton. and mental illnesses. 


aggressive behavior, and in a famous human 
case, several male members of a Dutch fam- 
ily who were pathologically violent lacked 
the gene that encoded for the enzyme. 

Moffitt and Caspi analyzed their study 
members’ DNA for the gene. They found 
that among maltreated children, those who 
were genetically prone to make low levels 
of MAO-A were far more likely than their 
counterparts to become violent adults. It was 
the first substantial evidence that a specific 
gene could modulate the effect of a known 
cause of criminal behavior. 

The finding, published in 2002 in Science, 
caused a media sensation, but some research- 
ers questioned the utility of such studies. 
“Sure there’s a genetic effect in people who 
have been exposed to violence,’ says Dean 
Hamer, scientist emeritus at the National 
Institutes of Health in Bethesda, Maryland, 
who has written widely on genes and be- 
havior. “But the effects are so small and so 
variable” as to be dwarfed by the dangers of 
childhood abuse. 

Since then, investigators moved away from 
individual genes. “We’re long past the low- 
hanging fruit of Mendelian genetics,” Tabery 
says. Geneticists now survey the genomes 
of thousands of people with a particular 
condition—asthma, for example—in search 
of gene/DNA variants whose small effects 
on risk combine. Often it turns out that hun- 
dreds or even thousands of genes contribute 
to the overall risk, not just one. Moffitt and 
Caspi have adopted that approach to search 
for genetic factors in conditions including 
tobacco addiction and obesity. 

Recently, Caspi compared longitudinal 
research to farming—“sowing, nurturing, 
waiting, and harvesting.” It’s an apt com- 
parison, given that his and Moffitt’s grand- 
parents were farmers. Year after year they 
have harvested data, and new revelations 
about human development have emerged. 
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9 Tests of aging introduced, 
including grip strength, balance, 
and retinal imaging, which reveals 
vascular health of brain. 


9 Cannabis 
Long-term heavy 
cannabis use that 
begins in adolescence 
is linked to cognitive 
decline in adulthood. 


10 Early aging 
Multiple indicators 
reveal signs of rapid 
physical and cognitive 
aging in some 

adults in their 30s. 


2010 


10 Study searches nationwide 
electronic administrative 
data, such as medical and 
social welfare records. 


In 1987, Silva was among the first to rec- 
ognize that children can experience hal- 
lucinations, when he detected them while 
interviewing 1l-year-olds. As the subjects 
aged and the study accumulated data, an 
ominous connection emerged. Follow-ups 
at age 26 revealed that the kids with hal- 
lucinations had developed full-on schizo- 
phrenia at more than 25 times the rate 
of the general population. (Schizophrenia 
is so rare that the sample numbers were 
small, however.) 

The work, reported in 2013, added sup- 
port to the emerging practice of “prodromal” 
psychiatry, in which symptoms are identi- 
fied and treated in their earliest stages, years 
before they blossom into disorders (Science, 
17 November 2017, p. 856). “I’ve had moth- 
ers write me letters all the time saying, ‘My 
kid has some pretty crazy ideas, where can I 
go?’” says Moffitt, who describes the work as 
the most gratifying of her career. Now, she 
can direct them to clinics for treatment. 

“Tt took a quarter of a century to see this 
unfold,” Caspi says. “You just have to be pa- 
tient. Longitudinal research is a big exercise 
in delayed gratification.” 


WHEN MOFFITT AND CASPI came back to the 
United States in 2007 to accept professor- 
ships at Duke University in Durham, it was 
a chance for Moffitt to return to her origins. 
They bought an old family property in central 
North Carolina, removed tons of old tires and 
debris, rebuilt structures, cleared trees, and 
built a new farmhouse. 

Origins mean a lot in their work, which 
has shown that our nature as adults—our 
capacity for self-control, our propensity for 
violence—has deep roots in the children we 
once were. Recently, Moffitt and Caspi took 
a broad look at how their study members, 
now 45, are faring. It’s a poignant time of 
life, when middle-aged adults face the real- 
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11 Social ills 

Twenty-two percent of the cohort 
accounted for the bulk of welfare, 
medical, and insurance costs; 
fatherless child-rearing; and crime. 
Many had poor brain health at age 3. 


000 


11 Study adds brain scans, 
dental imagery, bone density 
scans, and assessments of 
pain, hearing, vision, and gait. 


12 Mental health 
Dunedin and other studies 
show that most people 
have at least one episode 
of mental illness during 
their lifetime. 


@ 
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12 Family members 
interviewed and records 
searched to document 
family history of dementia. 
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ity that many of their youthful dreams won’t 
come true. The Dunedin data showed that 
just over a fifth of the population accounts 
for the bulk of the social costs: crime, welfare 
payments, hospitalizations, cigarette pur- 
chases, fatherless child-rearing, and other 
indicators of social dysfunction. 

What’s wrong with these people? Moffitt 
and Caspi went back and looked at their 
data from age 3. The target group seemed 
cursed from the beginning: They scored 
low on early language skills, fine and gross 
motor skills, neurological health, and self- 
control. Often they also grew up in poverty 
and suffered maltreatment. All through life 
their disadvantages haunted them. “They 
didn’t get a fair start right out of the starting 
block,” Moffitt says. “You can’t expect people 
with this kind of childhood to do well.” 

They even seem to age faster than those 
who had a better start. In their cohort, 
Moffitt and Caspi have been finding signs 
of aging starting in the 30s. They’re par- 
ticularly struck by the effects of stress at an 
early age. Childhood abuse seems to erode 
telomeres—the caps at the end of chromo- 
somes, associated with cell preservation— 
and that, in turn, may accelerate aging. 

Moffitt and Caspi offer no grand unified 
theory of human development: Humans are 
too complicated, too irrational, to sum up in 
a principle. What their research gives them 
is not so much a conclusion about humans 
as a particular point of view. 

“All people are not created equal,” Moffitt 
says. “Some have real gifts and talents, and 
some have real problems right out of the 
starting block. Once we accept that, we can’t 
dodge the responsibility for social action.” 

Watching people’s lives unfold over de- 
cades, she adds, “obliges compassion.” 


Douglas Starr is co-director of the program 
in science journalism at Boston University. 
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Out of balance in the Arctic 


Polar bears have high energy requirements that rise 
further as a result of climate change 


By John P. Whiteman 


s human activities lead to rising 
greenhouse gas concentrations in 
Earth’s atmosphere, less incoming 
solar energy is released back into 
space, causing a net energy gain that 
increases global temperatures. The 
consequence of climate change for polar 
bears can likewise be understood in terms 
of an energy imbalance. Sea ice melting re- 
duces the opportunities for polar bears to 
capture seals (see the photo), leaving them 
at risk of expending more energy in the pur- 
suit of food than they can obtain. The mag- 
nitude of this imbalance is determined by 
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their rate of energy use. On page 568 of this 
issue, Pagano et al. (1) quantify the energy 
expense of wild polar bears and show that it 
is higher than previously estimated. 
Animal energy use is typically described 
by two terms. The resting metabolic rate 
(RMR) includes the energy cost of basic or- 
ganismal functions, such as blood circula- 
tion and breathing. The field metabolic rate 
(FMR) includes RMR as well as the energy 
cost of additional activities, such as move- 
ment and foraging. Because energy bal- 
ance influences whether an animal survives 
and reproduces, RMR and FMR are critical 
variables in ecology and conservation. But 
they are notoriously difficult to quantify for 
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Polar bears look for food onshore in East Greenland, 
August 2016. The species needs sea ice as a platform 
to capture energy-rich prey, particularly seals. Polar 
bear population trends currently vary across the 
Arctic, but scientists expect broad declines if sea ice 
loss continues. 


large, mobile animals, particularly in re- 
mote habitats. 

Scientists have previously estimated po- 
lar bear energy requirements by calculat- 
ing RMR with predictive body mass-based 
equations derived from other species and 
then multiplying RMR by a constant to yield 
FMR (2). In another study, a researcher ex- 
trapolated field observations of seal kill rates 
to estimate the energy intake and FMR of 
polar bears (3). These and other studies as- 
sumed that polar bears can spontaneously 
reduce their RMR during food deprivation, 
which would be beneficial while searching 
for sparsely distributed seals. A recent study, 


Department of Biology, University of New Mexico, 
Albuquerque, NM 87131, USA. Email: jwhiteman@unm.edu 
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however, found that polar bear energy ex- 
pense does not substantially decline during 
fasting (4), leaving the magnitudes of both 
their RMR and FMR uncertain. 

In their study, Pagano et al. measured 
rates of oxygen consumption and carbon di- 
oxide production in polar bears; these vari- 
ables can be converted to energy use. First, 
the researchers assessed RMR by training 
a zoo polar bear to sit quietly 
in a chamber while its oxygen 
consumption was measured. 
Next, they quantified FMR in 
nine free-ranging bears on sea 
ice near the northern coast of 
Alaska, USA, by using doubly 
labeled water. To do so, the au- 
thors first captured the bears 
and injected them with water la- 
beled with heavy stable isotopes 
(H,"O); they also collected a 


mals, cannot provide enough food for polar 
bears driven to shore by loss of sea ice (8). 
In addition, Pagano et al.’s results demon- 
strate that polar bears do not reduce their 
metabolic rate when food is scarce (4). 
Four of the nine bears lost =10% of their 
body mass during the days between blood 
sampling events; at least one of these bears 
lost substantial lean tissue (structural pro- 


Energy balance of adult female polar bears 
A1/5-kg polar bear needs far more energy than a herbivorous giant panda 
(Ailuropoda melanoleuca), scaled to the same body mass. Polar bears 
therefore require access to high-energy food sources, particularly pagophilic 
(ice-loving) seals. 


Fy 


Giant pandas 


Polar bear 


The data also help to resolve why the pop- 
ulation-level response of polar bears to sea 
ice loss varies by region. For example, two 
Alaskan subpopulations of polar bears have 
recently experienced substantial ice loss, but 
only one of them is declining thus far (77). 
High and relatively inflexible RMRs and 
FMBRs for polar bears suggest that population 
responses are mainly driven by local, eco- 
logical factors, such as primary 
productivity or overlap of seal dis- 
tribution with remaining sea ice 
habitat. Ice loss, if unabated, will 
eventually cause the extinction of 
polar bears in the wild (72), but 
continued research is needed to 
understand the climate-related 
pressures that polar bears face. 
Such data-driven explanations of 
the biological consequences of cli- 
mate change are critical for public 


baseline blood sample. About 10 
days later, they recaptured the 


understanding and action (73). 
Important questions remain 


regarding the energy balance of 
polar bears. Pagano et al. cap- 


tured free-ranging individuals 
during spring, when new seal 


bears and took a second blood 40 
sample. In the interim, the in- sk 8G 
jected °?H left the body in water 3 
molecules (for example, through = 20 
urination), whereas the "O left 

the body in both water and ex- 10 


pups and attentive adult seals 
are most vulnerable to predation. 


haled CO, because oxygen atoms 
readily exchange between H,O 0 


During winter, polar bear activ- 
ity declines, and some individu- 


and CO, in the blood. As a result, 
the difference in decline of *O 
and ?H between sampling events 
represents the CO, production 
rate, which in turn reflects FMR. 

FMRs of the _ free-ranging 
bears were on average 1.6 times 
higher than previous estimates 
(see the figure) but were consis- 
tent with expectations for mammals that 
mainly consume other vertebrates; these 
mammals expend more energy than those 
consuming vegetation or mixed diets (5). 
Similarly, the zoo bear had a relatively high 
RMR, matching expectations for a carni- 
vore. Although the mechanism for the high 
energy cost associated with meat-eating is 
unclear (6), these findings emphasize that 
polar bears have evolved as hunters, partic- 
ularly of fat-rich marine mammal prey. This 
distinguishes them from other bear species, 
which include herbivores, insectivores, and 
omnivores. Polar bear activity patterns are 
also different from those of other bears: On 
average, the individuals sampled by Pagano 
et al. were active for 34% of their time, mak- 
ing them more similar to large terrestrial 
carnivores (39%) than to other bear species 
(50 to 60%) (7). 

The high energy requirements of polar 
bears corroborate previous hypotheses 
that most terrestrial Arctic habitats, lack- 
ing prey as energy-rich as marine mam- 
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Previous 
estimate : (14) 
(2) 


Semicaptive 


Primary prey 


Field metabolic rate 
@ Resting metabolic rate 


Bearded seal Ringed seal 


Erignathus barbatus Pusa hispida 


~3700 MJ ~600 MJ 

teins such as muscle) as well as fat (the pri- 
mary form of energy storage). These bears 
nevertheless had high FMRs. 

Pagano et al.’s findings refine our under- 
standing of how polar bears are affected by 
environmental change in the Arctic. Lon- 
ger periods of ice melt are lengthening the 
time during which the bears must endure 
food deprivation in summer. Molnar et al. 
have previously used anatomical models of 
energy storage and body mass change over 
time to estimate that starvation mortality of 
adult male bears would increase from 6 to 
48% if summer fasting were extended from 
120 days to 180 days (9). Such models can 
now be modified to include empirical FMR 
estimates. Further, thinner sea ice drifts 
faster, and polar bears have had to increase 
their movement rates in order to remain in 
their preferred habitat, incurring higher en- 
ergy costs (10). These costs can now be mod- 
eled more precisely because of the strong 
linear relationship between movement rate 
and FMR reported by Pagano et al. 
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Wild 
(15) 


als retreat into “shelter dens” for 
days to weeks at a time (exclud- 
ing pregnant females, which hi- 
bernate). It is unknown whether 
there is a concurrent seasonal re- 
duction in FMR. Energy expense 
in late winter is critical because 
body condition reaches an annual 
low in this period, making it the 
most likely time for climate change-related 
mortality. In addition, Pagano et al. sampled 
bears without dependent cubs, leaving the 
cost of rearing young (such as lactation) to 
be quantified. 
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IMMUNOLOGY 


Enhancing responses to cancer unmunotherapy 


The capacity of antigen presentation influences responses to checkpoint immunotherapy 


By Pia Kvistborg! and 
Jonathan W. Yewdell* 


mmune checkpoint blocking therapies 

(ICBs) that target T cell inhibitory recep- 

tors (immune checkpoints) have been 

implemented in the clinic to treat a va- 

riety of malignancies. To exemplify the 

potential success of these therapies, the 
3-year overall survival for advanced mela- 
noma has increased from 12% before 2010, 
when standard of care was chemotherapy, 
to ~60% using ICBs (7). However, ICBs fail 
many patients (10 to 60% of treated patients 
respond, depending on cancer type), raising 
the obvious question of why. On page 582 of 
this issue, Chowell et al. (2) report that the 
success of ICBs is remarkably dependent on 
the ability to present diverse tumor anti- 
gens to T cells. 

Human leukocyte antigen (HLA) class I 
molecules present short intracellular 
peptides (antigens) on the surface of 
all nucleated cells and play a key role 
in CD8* T cell immunosurveillance 
of viruses and other intracellular 
pathogens. On recognizing foreign 
peptides presented by HLA class I 
molecules, CD8* T cells are activated. 
Humans possess three HZA class 
I loci: HLA-A, HLA-B, and HLA-C. 
Each locus is among the most diverse 
in humans, each having thousands 
of different alleles encoding varying 
HLA class I molecules (allomorphs) 
(3). Within a population of common 
geographic origin, certain alleles are 
common, potentially decreasing the 
diversity of HLA allomorphs. Each al- 
lomorph presents a distinct, though 
often overlapping, repertoire of anti- 
gens, termed the immunopeptidome. 

Immunosurveillance of can- 
cer cells is restricted by central T 
cell tolerance, a process that re- 
moves self-reactive clones. Cancer 


For antitumor T cell 
responses to be 
activated, HLA 
molecules 
presenting tumor 
neoantigens 

need to be present. ¢t 


Even if T cells are 
reactive to HLA- 


which is commonly involved in carcino- 
genesis. Indeed, neoantigens are thought 
to be important for responses to ICBs (4). 
However, the impact of expressing differ- 
ent HLA allomorphs on response to ICBs 
has not been appreciated. 

Studying more than 1500 cancer patients, 
Chowell et al. show a significant dependence 
of ICB responsiveness on HLA-A, HLA-B, and 
HLA-C heterozygosity. This trend is enhanced 
by (but not dependent on) the mutation load 
of the cells within the tumor, determined 
through exome sequencing of resected tu- 
mors. ICB efficacy was diminished by loss of 
HLA heterozygosity, presumably the result 
of immune selection for tumor cells with de- 
creased expression of the cognate allomorph 
(see the figure). This builds on previous 
findings that ~40% of lung cancers display 
somatic loss of one or more HLA alleles (5), 
which results in decreased antigen presenta- 


Tumor cell killing by T cells 
T cell activation and tumor cell killing depends on multiple factors 
involving antigen presentation and T cell activation. 
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ae Immune checkpoint 
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tion. This is particularly important if there is 
somatic loss of an HLA allele by which the 
majority of antitumor T cells are activated. 
The HLA-A and HLA-B genes present in the 
cohort were divided into six supergene fami- 
lies based on the immunopeptidomes they 
present (6). Further implicating a critical role 
of HLA class I in ICB responses, Chowell et 
al. found a strongly positive effect on survival 
of HLA-B44 supertype alleles in melanoma 
patients receiving ICBs, but no association 
with survival in patients not receiving ICBs. 
Intriguingly, the correlation did not extend 
to ICBs of lung cancer, which may reflect the 
low number of lung cancer patients in the 
study. Chowell et al. also found that HLA-B62 
supertype expression was associated with de- 
creased effectiveness of ICBs in melanoma. 
These findings affect the selection of pa- 
tients who are likely to benefit from ICBs, 
which have potentially life-threatening side 
effects, and are sufficiently costly to 
strain national health care budgets. 
More generally, they provide impor- 
tant clues about the mechanisms of 
antitumor immunity. Tumors must 
walk a tightrope in escaping CD8* T 
cells: loss of HLA class I molecules 
triggers natural killer (NK) cells, 
which also play an important role 
in tumor control. Indeed, this may 
be key to the observation by Chow- 
ell et al. that HLA-C alleles had the 
strongest positive effect on ICB re- 
sponses because HLA-C molecules 
play a critical role in activating NK 
cell inhibitory receptors and pre- 
sent a more limited immunopepti- 


cells, however, often express “neo- 
antigens,’ mainly resulting from 
peptide fragments of mutated pro- 
teins that arise from DNA damage, 
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et al. propose that the generally 
higher complexity of the HLA-B44 
supertype-associated immuno- 
peptidome increases the number 
of cancer neoantigens presented. 
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Conversely, they propose that a more rigid 
structure of the HLA-B62 molecule some- 
how reduces the capacity of T cell receptors 
to recognize neoantigens. This would not, 
however, explain the negative correlation of 
HLA-B62 with survival, which implies inter- 
ference with antitumor responses. 

Too little is known about the effects of 
polymorphism on HLA function to have 
any confidence in interpreting these im- 
portant observations. It is likely that only 
a minor fraction of potentially immuno- 
genic neoantigens have been identified 
by traditional exome sequencing, which 
ignores peptides generated from nonca- 
nonical translation products. These include 
CUG-initiated proteins, small frameshifted 
proteins, proteins from “untranslated” read- 
ing frames, and intron-encoded proteins (8). 
Given the plethora of abnormalities in cancer 
cells, noncanonical translation is likely to in- 
crease. This might occur in parallel with mu- 
tation load, contributing to the correlation 
with ICB responsiveness. 

Additionally, HLA class I allomorphs ex- 
hibit astonishing differences, including 10- 
fold or more variation in their export rate 
from the endoplasmic reticulum (9), where 
they are loaded with peptides. Furthermore, 
little is known about how allelism affects the 
pathways used by dendritic cells and macro- 
phages to acquire tumor antigens from, for 
example, dying tumor cells. This can influ- 
ence the quality of both T cell activation and 
antitumor immunity (JO). 

There are large gaps in our understanding 
of the astonishing diversity of the HLA mole- 
cules (HLA class II molecules, which present 
extracellular peptides, are similarly polymor- 
phic). As HLA molecules are involved in all 
aspects of immunity and autoimmunity, as 
well as neuronal development, and likely 
other biological phenomena, the findings of 
Chowell et al. encourage increasing under- 
standing of the impact of HLA polymorphism 
on vertebrate biology and human health. & 
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The bacteria, D. acetivorans, that carry out the roTCA cycle are found in volcanic caldera in Kamchatka, Russia. 


MICROBIOLOGY 


Stealth reactions driving 


carbon fixation 


New twists to bacterial metabolic pathways that 
contribute to the global carbon cycle 


By Stephen W. Ragsdale 


rganisms live in an interconnected 

dynamic web in which they make, 

degrade, and interconvert com- 

pounds containing carbon, hydro- 

gen, nitrogen, oxygen, phosphorus, 

and sulfur. The carbon cycle involves 
the oxidation of organic compounds to pro- 
duce CO, by heterotrophic organisms and 
the incorporation (“fixation”) of CO, from 
the environment into living tissue by au- 
totrophic organisms. Heterotrophic organ- 
isms (most animals) obtain the energy for 
life by conserving the energy obtained by 
oxidizing organic molecules to CO, in the 
form of reducing equivalents (electrons) 
and adenosine triphosphate (ATP). Auto- 
trophic plants, bacteria, and archaea fix 
CO, by a process in which the energy of 
electrons and ATP is used to produce bio- 
molecules, such as sugars, amino acids, and 
lipids, thereby replenishing these essential 
organic molecules in the ecosystem. Cu- 
mulatively, autotrophy occurs on the huge 
scale of 7 x 10" g of carbon fixed annually 
(1). Six CO, fixation pathways differing in 
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their ATP requirements are known to exist. 
Fixing CO, using the least ATP possible is 
key for anaerobes because their metabolism 
generates much less ATP than does growth 
on oxygen. The reductive tricarboxylic acid 
(rTCA) cycle is one of the most evolution- 
arily ancient and least ATP-demanding au- 
totrophic pathways. On pages 563 and 559 
of this issue, Mall et al. (2) and Nunoura et 
al. (3), respectively, uncover an unexpected 
ATP-conserving mechanism, and Pachia- 
daki et al. (4) report a surprising source of 
reducing equivalents in the rTCA cycle. 
According to our understanding of the 
rTCA cycle, three enzymes that catalyze reac- 
tions in the oxidative TCA (oTCA) cycle that 
would be highly thermodynamically unfavor- 
able if performed in the reverse direction are 
substituted by parallel energetically favorable 
reactions (5) (see the figure). In the rTCA 
cycle, citrate synthase (CS), which catalyzes 
conversion of oxaloacetate and acetyl-CoA 
(coenzyme A) to citrate and CoA, the most 
irreversible reaction in the oTCA cycle, is re- 
placed by ATP-dependent citrate lyase (ACL) 
or homologous enzymes that catalyze this 
reaction in two steps (5, 6). The existence 
of ACL is the typical indicator for an opera- 
tional rTCA cycle in an organism (2). The use 
of ATP in the ACL reaction converts a moun- 
tainously thermodynamically unfavorable 
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reaction into a slightly uphill one. Thus, the 
direct physiological reversal of CS has been 
thought to be impossible. 

It is time to change these views of CS and 
of the rTCA cycle. Mall et al. and Nunoura et 
al. showed that two thermophilic sulfur-re- 
ducing anaerobic bacteria, Desulfurella ace- 
tivorans and Thermosulfidibacter takait, 
lack the gene encoding ACL, but retain 
all other rTCA cycle enzymes. Both organ- 
isms also lack the genes required for other 
known CO, fixation pathways. Surprisingly, 
despite the high thermodynamic barrier, 
they found that CS functioned in 
reverse. To distinguish this path- 
way from the typical rTCA cycle, 
this was termed the reversed oxi- 
dative TCA (roTCA) cycle. 

How do these organisms over- 
come the huge thermodynamic 
barrier for reversing CS? Appar- 
ently, they rapidly and efficiently 
couple roCS with the succeed- 
ing reaction(s) to keep the cel- 
lular roCS substrate (citrate and 
CoA)/product (oxaloacetate and 
acetyl-CoA) ratio extremely high. 
Mall et al. detected very high ac- 
tivities of both roCS and reverse 
oxidative malate dehydrogenase CO, { 4 
(roMDH), which would promote 2¢-¥: 
rapid conversion of oxaloacetate i 
(from roCS activity) to malate. 
However, enhancing the activity 
of these enzymes will increase the 
rates of both oxidative and reverse 
reactions. The key is flux. These 
organisms must efficiently convert 
the roCS and roMDH products 
malate and acetyl-CoA into cellu- 
lar biomolecules, thus maintaining very low 
concentrations of oxaloacetate, effectively 
pulling flux through the roTCA cycle. Even by 
bypassing the ATP provided by the ACL-cat- 
alyzed reaction in the rTCA cycle, the overall 
roTCA cycle is thermodynamically favorable; 
thus, another trick may involve bifurcation, 
driving the unfavorable half of the cycle by 
coupling it to the favorable half, yielding a 
net spontaneous process. Nunoura et al. sug- 
gest that abundant input of redox equiva- 
lents via electron bifurcation may provide 
the needed thermodynamic push. However, 
roCS is not an electron transfer reaction, so 
it would require an unknown mechanism 
to make such a bifurcation work. To add to 
the mystery, the CSs of both organisms have 
rather typical kinetic parameters and are not 
specifically adapted to run in reverse. 

The essential point is that reversing CS 
spares one ATP for the organism during 
autotrophic growth, which is potentially 
important as ATP is of premium value for 
anaerobes. Thus, these fastidious microbes 
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require a single ATP to fix two molecules of 
CO,, matching the stoichiometry, efficiency, 
and free energy (a measure of thermodynam- 
ics) of the Wood-Ljungdahl pathway (6). By 
contrast, the prevalent CO,-fixing pathway, 
the Calvin cycle, uses seven ATPs. Not sur- 
prisingly, the Calvin cycle has not been found 
to occur in anaerobes. In aerobes, O, is used 
to generate an ample supply of ATP. Further- 
more, because ACL is the signature enzyme 
for detecting the rTCA cycle, the roTCA cycle 
could be a widespread mode of CO, fixation, 
previously hidden below our genomic radar. 


The autotrophic roTCA cycle 
In the rTCA cycle, two CO, molecules are fixed to generate acetyl-CoA, 
which is used for biosynthesis. Because three reactions in the oTCA cycle are 
irreversible, they are substituted by ATP-dependent enzymes, such as ACL, 
inthe rTCA cycle. Surprisingly, in the roTCA cycle, citrate synthase (CS) is 
retained. The discovery of nitrite oxidation as a source of electrons in the 
rTCA cycle is another unexpected turn of events in carbon fixation. 
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Mall et al. asked if there are other examples 
of unexpected reactions running in reverse. 
Anaerobic methane oxidation is one example. 
This is an unusual pathway that is important 
to the carbon cycle in which the entire reac- 
tion sequence of CO, reduction to methane 
runs in reverse (7). This appears to occur in a 
microbial community by coupling the reduc- 
ing equivalents generated by the methane 
oxidizer to the highly favorable reduction of 
electron acceptors—for example, sulfate—by 
a sulfate-reducing organism (8). 

A mode of CO, fixation by rTCA based 
on nitrite oxidation has also been lurking 
unnoticed in the dark ocean where light is 
too dim to produce ATP using photosynthe- 
sis and fix CO, by the Calvin cycle. Using 
metagenomic analyses of nitrite-oxidizing 
bacteria, Pachiadaki et al. identified a 
highly represented phylum, Nitrospinae, 
which contains genes encoding nitrite 
oxidoreductase and rTCA cycle enzymes, yet 
lacks the genes for other energy-generating 
pathways such as ammonium or sulfur oxi- 
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dation. Thus, CO, fixation in the rTCA cycle 
by these bacteria appears to rely on nitrite 
oxidation as a source of electrons. The au- 
thors found that nitrite oxidation appears 
to be responsible for up to 43% of the to- 
tal carbon fixed at some ocean depths. This 
process has thus far been underappreciated 
because nitrite is found at very low concen- 
trations in the ocean. Correspondingly, the 
nitrite oxidoreductases in these bacteria 
have very high affinity for their substrates. 

Coming back full circle to the roTCA 
cycle, the genome of the most frequently 
encountered marine nitrite oxi- 
dizer contains coding sequences 
with similarity to those of known 
ACLs (9), making it likely that 
Nitrospinae use the rTCA cycle 
to fix carbon, using nitrite as an 
electron source. However, given 
the precedent described by Mall 
et al. and Nunoura et al., the use 
of roCS in this phylum of bacte- 
ria cannot be excluded. 

It is exciting to alter our con- 
ceptions about the rTCA cycle, a 
key pathway in the global carbon 
cycle. rTCA has been studied for 
50 years. This pathway and the 
Wood-Ljungdahl pathway (6) 
have the lowest energetic costs 
and are considered to be the most 
evolutionarily ancient modes 
of CO, fixation. The new find- 
ings highlighted here reveal an 
unusual mechanism (roCS) that 
microbes use to lower the ATP 
requirement for the roTCA cycle 
and the use of a surprising source 
of electrons (nitrite) to drive the 
rTCA cycle. Future studies could reveal the 
details of how such efficient flux through 
the roCS reaction is accomplished. In ad- 
dition, we can look forward to results that 
clearly quantify the relative global contri- 
butions of the various pathways and elec- 
tron sources that drive CO, fixation and to 
the discovery of other autotrophic meta- 
bolic systems. 
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SUPERCONDUCTIVITY 


Lighting up superconducting stripes 


Terahertz radiation reveals pair density waves in underdoped cuprate superconductors 


By Emre Ergecen and Nuh Gedik 


uprate superconductors display a 
plethora of complex phases as a 
function of temperature and carrier 
concentration, the understanding 
of which could provide clues into 
the mechanism of superconductiv- 
ity. For example, when about one-eighth 
of the conduction electrons are removed 
from the copper oxygen planes in cuprates 
such as La, Ba,CuO, (LBCO), the doped 
holes (missing electrons) organize into 
one-dimensional stripes (7). The bulk su- 
perconducting transition temperature (T.) 
is greatly reduced, and just above T., elec- 
trical transport perpendicular to the planes 
(along the c axis) becomes resistive, but 
parallel to the copper oxygen planes, resis- 
tivity remains zero for a range of tempera- 
tures (2). It was proposed a decade ago (3) 
that this anisotropic behavior is caused by 
pair density waves (PDWs); superconduct- 
ing Cooper pairs exist along the stripes 
within the planes but cannot tunnel to the 
adjacent layers. On page 575 of this issue, 
Rajasekaran et al. (4) now report detection 
of this state in LBCO using nonlinear reflec- 
tion of high-intensity terahertz (THz) light. 
In LBCO, charge and spin ordering coex- 
ist in the stripe phase. Within the plane, 
stripes are separated by regions with insu- 
lating antiferromagnetic spin order. In the 
adjacent plane, this pattern is rotated by 
90°. The PDW state predicted by Berg et al. 
(3) is based on this intricate packing of the 
stripes that cancels the interlayer coupling 
confining the superconductivity to two di- 
mensions. Hamidian et al. (5) captured this 
elusive order using a scanning Josephson 
tunneling microscope in Bi,Sr,CaCu,O, 
below 7.. Whether this state also exists in 
LBCO above T, has been an open question. 
The dynamics of paired electrons in a 
superconductor can be described by the su- 
perconducting order parameter. Any spatial 
change of phase will result in a current with- 
out dissipation. A current will flow between 
two superconducting regions with different 
phases if separated spatially by a thin layer 
of insulator (a Josephson junction), and the 
current through the thin insulating layer 
is a nonlinear function of the phase differ- 
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ence (see the figure, left panel). In the case 
of cuprates below 7., superconducting CuO, 
planes are separated by insulating regions 
that allow the interlayer tunneling of Cooper 
pairs by means of the Josephson effect. Below 
T., a plasma resonance appears in the c-axis 
optical properties because of this Josephson 
coupling. Such optical properties are not 
seen in the normal state of underdoped cu- 
prates because c-axis transport is resistive. 
For the proposed PDW state in the stripe 
phase, Josephson coupling is highly sup- 


(3) can give rise to this observed nonlinear 
effect. The specific stacking of the stripes re- 
sults in a lattice of Josephson junctions with 
alternating phases. This lattice supports two 
different modes of tunneling currents with 
different spatial symmetry. Both of these 
modes are optically silent and cannot be seen 
with linear optics. However, when the inten- 
sity of the THz light is increased, nonlinearity 
of the Josephson coupling mixes these two 
modes in a special way, producing a signal 
at the third harmonic of the incident THz 


Josephson junctions and the pair density wave (PDW) 


Current-voltage (/-V) nonlinearity in the Josephson effect enables detection of stripe-phase PDW. 
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Current nonlinearities in 
Josephson junctions 

A thin insulator (IN) between two 
superconductors (SC) creates odd 
current harmonics. 


Superconducti 
of the stripes d 


nonlinearities i 


pressed between the planes and reduces bulk 
T,and the plasma resonance. Rajasekaran et 
al. used THz light pulses to study the c-axis 
reflectivity of LBCO in samples with dop- 
ings of 9.5% (T, = 34 K) and 15.5% (T, = 32 
K), where superconductivity coexists with a 
weak charge order that also vanishes near 
T, (6). They observed the plasma edge in 
the c-axis reflectivity in the superconducting 
state. As the intensity of THz light increased, 
another feature arose at a frequency three 
times higher than that of the incident light 
from the inherent nonlinearity of the Joseph- 
son effect. Both of these features disappeared 
at T.. For the sample with 11.5% doping, in 
which strong charge order persisted up to the 
charge-ordering temperature 7, = 55 K and 
T, was suppressed down to 13 K (see the fig- 
ure, right panel), a third harmonic peak was 
again seen, but it persisted up to T.. 

No bulk superconductivity exists above 
T., so another phenomenon must be respon- 
sible for this feature. Rajasekaran et al. show 
that the PDW state proposed by Berg et al. 
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Pair density wave in underdoped LBCO 


vity develops along the stripes. However, special ordering 
couples neighboring planes up to linear order, preventing 


bulk superconductivity. A high-intensity THz pulse can probe the 


ninterlayer Josephson coupling. 


field. Through this mechanism, the third- 
harmonic signal survived up to 7, without 
bulk superconductivity. 

Nonlinear THz measurements performed 
with ultrafast laser pulses reveal the PDW or- 
der in LBCO as predicted by theory a decade 
ago. However, how these ultrafast measure- 
ments should be compared to the static mea- 
surements is still an open question. A next 
step is to use this technique to study charge 
orders seen in other cuprates (7). These re- 
sults also reaffirm the usefulness of nonlinear 
light-matter interaction in revealing hidden 
orders in quantum materials. 
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ASTRONOMY 


Galaxy motions cause trouble for cosmology 


Satellite galaxy motions around a nearby galaxy seem to be at odds with dark matter models 


By Michael Boylan-Kolchin 


ccording to the widely accepted dark 
energy plus cold dark matter (ACDM) 
model, dark matter is responsible for 
both the growth of cosmological struc- 
tures and the motions of galaxies rela- 
tive to the expansion of the universe. 
The dynamics of small galaxies orbiting 
larger ones provides a crucial window into 
this mysterious dark matter, which leaves its 
gravitational mark throughout the universe 
but has not yet been detected directly. On 
page 534 of this issue, Miller et al. (1) describe 
observations of satellite galaxies around Cen- 
taurus A, the largest galaxy system in the vi- 
cinity of the Milky Way. The results may lead 
to either a better understanding of galaxy for- 
mation within the ACDM model or a push to 
overthrow its underlying assumptions. 
Although the fundamental nature of both 
dark matter and dark energy remains elu- 
sive, the ACDM model has proved to be a 
remarkably effective framework for un- 
derstanding the large-scale distribution of 
matter and energy from the earliest cosmic 
epochs to the present day (2, 3). ACDM is 


also the basis of modern theories of galaxy 
formation, and cosmological simulations 
using the model can produce virtual uni- 
verses that match the observed universe 
both broadly and in detail (4, 5). 

However, ACDM model predictions do not 
agree with observations of the motions of 
small satellite galaxies around the Milky Way 
and its nearby companion, the Andromeda 
galaxy (6, 7). According to ACDM, the dark 
matter surrounding each of these galaxies 
should be roughly spherically distributed, 
yet both the Milky Way and Andromeda ap- 
pear to be surrounded by planar structures of 
satellites. In the Milky Way, where measure- 
ments of the full, three-dimensional veloc- 
ity of a satellite can be made in many cases, 
several satellites also share a common sense 
of rotation. Cosmological simulations can 
produce thin planes of satellite galaxies, but 
these planes are chance alignments rather 
than dynamically stable configurations. Re- 
producing stable, rotating planes has proved 
elusive in ACDM (8). 

Is this discrepancy a reason to aban- 
don ACDM and search for an alternative 
model? Some scientists have pointed to sat- 


Do planes of satellites defy the standard cosmological model? 

The Centaurus A galaxy is surrounded by a plane of satellites, which defies naive interpretations of the standard 
cosmological model. A possible reconciliation of theory and observation lies in correlations induced by the 
large-scale distribution of matter. Galaxies trace high-density filaments of dark matter; these filaments might 
be responsible for the observed planes of satellites in the nearby universe. 


@ Approaching galaxies @ Receding galaxies 
@ Not part of the plane 


Centaurus A (half-light radius: 25,000 light- 
years), appears to host a plane of satellite 
galaxies, much like the Milky Way and Andromeda. 
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The large-scale distribution of matter in the TNG300 numerical 
simulation is shown. Light (dark) colors denote regions of high 
(low) dark matter density. 1 Mpc = 3.26 x 10° light-years. 
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ellite planes as evidence that the very idea 
of dark matter is fundamentally incorrect 
and that effects currently ascribed to dark 
matter are instead the result of a modifica- 
tion to gravity (9). 

Most astrophysicists remain unconvinced 
by this argument, however. The evidence 
for cold dark matter—or something that be- 
haves very much like it—is very strong on 
various astrophysical scales and over a range 
of cosmic times. Modifying the laws of grav- 
ity is a straightforward idea but turns out to 
be extremely difficult in practice when con- 
strained by the wealth of data in support of 
dark matter and dark energy. Furthermore, 
many astronomers have been concerned 
about drawing conclusions from the near- 
est galaxy systems: The census of Milky Way 
satellite galaxies might be affected by the 
gas and stars in the Galaxy’s disk, and it is 
not currently possible to measure motions 
perpendicular to the plane of satellites in 
Andromeda, meaning its long-term stability 
remains unknown. 

The system that Miiller et al. report on— 
Centaurus A—is dynamically isolated from 
both the Milky Way and Andromeda, allow- 
ing them to test for planar structures in the 
galaxy distribution around a system that is 
not affected by either of these two galaxies. 
Their results strongly point to the existence 
of correlated structure in satellite galaxies 
around Centaurus A (see the figure). These 
findings make it less plausible that the obser- 
vations in all three galaxy systems—the Milky 
Way, Andromeda, and Centaurus A—arise 
from statistically rare configurations. 

Nevertheless, there are good reasons to 
be cautious when interpreting the data. 
In ACDM, the distribution of structures in 
nearby galaxies is not truly independent: 
It also depends on the distribution of mat- 
ter on larger scales of approximately 10 to 
20 times the distance between Earth and 
Centaurus A. The large-scale distribution of 
matter that is specific to our cosmic neigh- 
borhood might induce features similar to 
those observed. Cosmological ACDM simu- 
lations typically would not capture these 
system-specific effects. The planes in An- 
dromeda and Centaurus A appear to have 
a common cosmic orientation, providing 
support for the idea that large-scale struc- 
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ture is responsible for the existence of the 
observed satellite planes. 

Furthermore, only a subset of the satellites 
in each of the three systems (the Milky Way, 
Andromeda, and Centaurus A) lie in planes. 
Could the observations of satellite planes be 
a result of astronomers looking too hard for 
any sign of nonuniform structure in the dis- 
tribution of satellite galaxies and declaring 
success upon finding it? 

Miller ez al. argue that this is not the case. 
Fortunately, a definitive resolution may be 
in sight. The major uncertainty in assessing 
the origin of the observed planar structures 
is the perpendicular (out-of-plane) motion of 
the galaxies: True corotation, indicating a dy- 
namically stable plane, would require these 
velocities to be minimal. If the perpendicular 
velocities are instead comparable to the in- 
plane velocities, then the observed configu- 
rations are likely to be chance alignments. 
Obtaining the requisite tangential velocities 
is observationally prohibitive for galaxies in 
Centaurus A, but detecting (or excluding) 
large tangential velocities in the Andromeda 
system is firmly within the grasp of the Hub- 
ble and James Webb Space Telescopes over 
the next few years. 

Steven Weinberg once noted that “Physics 
thrives on crisis” (10). In this light, the cur- 
rent problem for astronomers and physicists 
is arguably the lack of a major crisis. They 
are in the uncomfortable position of having 
a theory—ACDM-—that explains most astro- 
nomical observations by invoking a postu- 
lated material that has eluded increasingly 
sophisticated attempts at detection. 

The distribution of satellites around nearby 
galaxies does not yet rise to the level of a cri- 
sis driving us to a new understanding, but the 
results of Miiller et al. raise the stakes. Per- 
haps most excitingly, any potential resolution 
of the puzzle of satellite planes is interesting: 
At worst, we improve our understanding 
of galaxy formation; at best, we are led to a 
deeper understanding of the laws of physics. 
If this and other issues related to small galax- 
ies (77) do not constitute a crisis, they may 
have to suffice until the crisis gets here. 


REFERENCES 


1. O.Miller, M.S. Pawlowski, H. Jerjen, F. Lelli, Science 359,534 
(2018). 

2. S.White,M. Rees, Mon. Not. R. Astron. Soc.183, 341 (1978). 

3. V.Springel, C. Frenk, S. White, Nature 440, 1137 (2006). 

4. M.Vogelsberger etal.,Mon. Not. R. Astron. Soc.444, 1518 
(2014). 

5. J.Schayeetal.,Mon. Not. R. Astron. Soc.446, 521 (2015). 

6. D.Lynden-Bell, Mon. Not. R. Astron. Soc.174, 695 (1976). 

7. R.A. lbata etal., Nature 493, 62 (2013). 

8. T.Buck,A. Dutton, A. Maccid, Mon. Not. R. Astron. Soc. 460, 
4348 (2016). 

9. P.Kroupa, C. Theis, C. Boily, Astron. Astrophys. 431, 517 
(2005). 

10. S.Weinberg, Rev. Mod. Phys.61,1 (1989). 

11. J. Bullock, M. Boylan-Kolchin, Annu. Rev. Astron. Astrophys. 
55,343 (2017). 


10.1126/science.aar7375 


SCIENCE sciencemag.org 


GENOMICS 


Tracing single-cell histories 


Origins of mutations in single cells during human brain 
development and aging are revealed 


By Je H. Lee 


NA mutations accumulate at a 

steady pace across the human ge- 

nome, passing from one genera- 

tion to another. On the basis of the 

degree of shared mutations, a ge- 

nealogical relationship can be re- 
constructed from ancient and modern 
individuals, allowing one to go back hun- 
dreds of thousands of years in human evo- 
lutionary history (7). Instead of comparing 
individuals, on pages 550 and 555 of this 
issue, Bae et al. (2) and Lodato et al. (3), 
respectively, assessed the rate of DNA mu- 
tation in single cells from developing and 
aging human brains, revealing mutational 
histories in neurodevelopment, aging, and 
neurodegeneration. These approaches 
also have implications for understanding 
complex diseases that could result from 
somatic mutations that arise later in life, 
such as cancer. 

De novo mutations in the DNA of egg or 
sperm can be associated with devastating 
disorders affecting young individuals (4). 
These germline mutations are widely de- 
tected because all cells in the body inherit 
them. By contrast, somatic DNA mutations 
sporadically occur throughout the life of 
an organism (that is, postzygotically) as a 
result of DNA damage and errors in DNA 
replication or repair. When somatic muta- 
tions occur early in life in dividing cells, 
they are found in a large number of cel- 
lular descendants. If mutations occur in 
dividing cells as humans age, they are 
found in only a limited number of cells, 
resulting in tissue mutational mosaicism 
(see the figure). The inheritance pattern of 
mutations in cells within a tissue can be 
used to establish a temporal or genealogi- 
cal relationship of mutations to better un- 
derstand the role of mutational mosaicism 
in human diseases. Bae et al. analyzed 
the rate and origin of somatic mutations 
in the brain occurring before birth that 
could predispose to functional alterations, 
including neuropsychiatric or neurodevel- 
opmental disorders. Lodato et al. address 
whether mutations continue to occur later 
in life and whether DNA mutagenesis in 
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nondividing cells is associated with neuro- 
degenerative disorders. 

To study somatic mutations that arise 
in development or during aging, an accu- 
rate single-cell whole-genome sequencing 
(WGS) method is necessary. This method 
requires sensitive whole-genome amplifica- 
tion techniques (5); however, DNA nucleo- 
tide bases are susceptible to damage (which 
can lead to mutation), and enzymes used in 
amplification introduce additional errors. 
Indeed, the number of false-positive single- 
nucleotide mutations [also referred to as 
single-nucleotide variations (SNVs)] can be 


“..advances in single- 

cell WGS [whole-genome 
sequencing] allow for 
examining recent, 
uncommon mutational 
events specific to each cell...” 


as high as 10* in single-cell WGS (5), vastly 
outnumbering naturally occurring SNVs 
(10? to 10? per cell). Even in cancer or popu- 
lation genome-sequencing projects, muta- 
genic DNA damage can be a major source of 
sequencing errors for rare SNVs (6). 

Building on previous methodology (7), 
Bae et al. addressed this challenge by 
adopting a single-cell cloning method using 
cultured neuronal precursors from three 
human fetal brains at 20 weeks of gesta- 
tion. If a cell contains a true heterozygous 
mutation, the wild-type and mutant alleles 
in the genome should each comprise ~50% 
of the cell population, even after multiple 
cell divisions. If mutations occur during 
cell culture or DNA amplification, the fre- 
quency of the heterozygous mutation drops 
to less than 25%. With this criterion, Bae et 
al. sequenced single cells from 31 clonal cul- 
tures, identifying somatic mutations by per- 
forming clone-to-clone and clone-to-tissue 
comparisons. This reduced the rate of false 
positives to 5% and false negatives to 17% in 
single-cell clones. 

Subsequently, they found 200 to 400 mo- 
saic SNVs in cultures of each neuronal pre- 
cursor, with the most common substitutions 
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being C-to-A transversions (a com- 
mon result of oxidative damage) and 
5-methylcytosine-to-thymine _ transi- 
tions (common in germline SNVs). 
Assuming a linear increase in somatic 
mutations over time, they estimate 8.6 
mutations per cell division, a signifi- 
cant increase compared to mutation 
rate estimates in early postzygotic cell 
divisions (1.3 mutations per cell divi- 
sion) (8, 9). Extending these estimates 
to regions where neurogenesis contin- 
ues throughout life (for example, in 
the hippocampus), Bae et al. hypoth- 
esize that mutational mosaicism aris- 
ing from dividing neuronal precursors 
over time may alter neuronal function 


Germ line 


in the mature brain for processes such += 

« o 

as memory and may thus contribute to E 

neurodegeneration. 2 

Lodato et al. took a different ap- 3 
proach. Examining postmitotic 


neurons, they used an algorithm 
to extrapolate the frequency of so- 
matic mutation using nearby known 
germline SNVs. With this method, 
mutations caused by DNA dam- 
age or amplification errors occur on 
one DNA strand but not the other, 
whereas naturally occurring  so- 
matic mutations are present on both 
strands. Although only mutations 
near germline SNVs can be discov- 
ered this way, it does not require cell 
culture or clonal expansion of individual 
cells, as in other approaches (2, 7). 

Lodato et al. carried out single-cell WGS 
on 93 normal postmitotic prefrontal cortex 
(PFC) neurons in individuals ranging from 
4 months to 82 years of age, 26 normal 
hippocampal dentate gyrus neurons, and 
42 PFC neurons from patients with DNA 
damage repair-deficient neurodegenera- 
tive disorders. Notably, they found 300 to 
900 SNVs per cell in PFC neurons within 1 
year of birth, consistent with the estimate 
of Bae et al. In addition, they observed a 
twofold higher rate of somatic mutations 
in hippocampal dentate gyrus neurons 
(with 40 mutations occurring per cell each 
year) than in PFC neurons, as was also pro- 
posed by Bae et al. Furthermore, patients 
with DNA damage repair-deficient neuro- 
degenerative disorders showed a twofold 
increase in somatic mutations in postmi- 
totic neurons across different age groups. 
They also detected three distinct muta- 
tional signatures in all of the cells studied, 
suggesting multiple mechanisms of so- 
matic mutation associated with brain de- 
velopment, aging, and neurodegeneration. 

Although these studies do not provide 
definite mechanisms of how somatic mu- 
tations occur or establish their functional 


Aging 
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Time of 
mutational origin 


Reconstructing cellular mutation history 
Mutations can be present in most cells or in just one cell within 
a tissue, depending on when they occur. Emerging sequencing 
technologies can accurately profile genome-wide somatic 
mutations, even in the tiniest mosaic piece—a single cell. 


Mutational 
mosaic 


consequences, it is exciting that Bae et al. 
and Lodato et al. utilized readily accessible 
methods to accurately sequence single-cell 
whole genomes and reconstructed muta- 
tion histories (akin to a “mutational clock”) 
with single-cell resolution. Because recent 
mutations are restricted to the tiniest mo- 
saic pieces (see the figure), mutational 
clocks had previously been limited to the 
early mutation events shared by many cells. 
These technological advances in single-cell 
WGS allow for examining recent, uncom- 
mon mutational events specific to each 
cell, considerably widening the temporal 
breadth of mutation-history reconstruction. 

In so doing, fundamental biological ques- 
tions that had been inaccessible can now be 
addressed, especially regarding the onset 
and progression of relatively recent somatic 
mutations in development or aging. Indeed, 
these studies now firmly establish the pres- 
ence of ongoing DNA damage later in life 
that accelerates in neurodegeneration (3). 
They also demonstrate the occurrence of 
coding and noncoding mutations that arise 
later during neurogenesis (2), potentially 
dividing the brain into smaller mutational 
territories, with implications for brain con- 
nectivity, function, and predisposition to 
neurological disorders. 
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The accuracy of single-cell WGS 
continues to improve, including 
new amplification methods with a 
lower error rate and more uniform 
coverage of the genome (5). Unfor- 
tunately, single-cell WGS is limited 
to a small number of cells because of 
throughput and cost bottlenecks. To 
accurately reconstruct mutation his- 
tory and mosaicism throughout life, 
it is necessary to trace cell lineages 
comprehensively. Recently, somatic 
alterations induced by members of 
the CRISPR-associated (Cas) family 
of DNA nucleases have been used to 
reconstruct cell lineages in model or- 
ganisms and cell lines (J0, 11). In this 
method, Cas9-induced alterations 
within a single synthetic locus can be 
traced. Such reporters are transcrip- 
tionally active, making them compat- 
ible with high-throughput single-cell 
RNA sequencing (12) to “read” the 
reporter and identify cell type-spe- 
cific RNA signatures. Indeed, re- 
porters can even be programmed to 
record sequential events in cells (73), 
potentially revealing environmental 
interactions that shape the tissue hi- 
erarchy and cell lineages within. 

One day, it might be possible to en- 
gineer cell lineage and activity record- 
ers throughout an entire organism, 
followed by comprehensive single-cell 
sequencing of recorders containing induced 
mutations. It might then be possible to 
compare naturally occurring somatic muta- 
tions in a subset of single cells using WGS 
across specific developmental stages. This 
could reveal biological processes that affect 
the mutational clock across different tissues 
or cellular activities and expand our under- 
standing of mutational signatures (14, 15), 
especially if such signatures are conserved 
across species. Precisely answering when, 
how, and where mutational events occur 
in cells in different tissues will help clarify 
important questions in development, aging, 
and disease research. ® 
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CORPORATE SUSTAINABILITY 


Scientifically assess impacts 
of sustainable investments 


Metrics can inform investors wary of “green washing” 


By C. J. Vorésmarty,!” V. Rodriguez 
Osuna,’ D. A. Koehler,’ P. Klop,* 

J. D. Spengler,’ J. J. Buonocore,°’ A. D. Cak,* 
Z. D. Tessler,! F. Corsi,” P. A. Green,! 

R. Sanchez’ 


he practice of selecting and managing 
financial assets based on their social 

and environmental performance is 
undergoing rapid growth and funda- 
mental change. Investors are increas- 

ingly pressed by asset owners to prove 

how one company’s practices are materially 
more or less sustainable than those of an- 
other. Yet, the basic information that com- 
panies declare is hardly standardized and 
is difficult to verify, with unreliable asser- 
tions (J) that are widely criticized as “green 
washing.” Metrics are mainly restricted to 
documenting changes to internal 
business practices but offer limited 
guidance on whether a company’s 
actions, products, and services pro- 
mote human well-being or preserve 
environmental integrity in the ex- 
ternal, real-world domain, fueling 
reluctance on the part of otherwise 
enthusiastic investors (2, 3). It is here 
where science can play an important 
role. Our consortium of an asset owner, an 
asset manager, and two research universi- 
ties is designing a next generation of trace- 
able indicators to quantify external context 
and impact of investments and place these 
into a decision-making framework useful to 
investors. Tests of these science-based sus- 
tainability metrics are under way on a $2.1 
billion portfolio of public equities invested 
on behalf of a large European pension fund. 
Estimated growth in assets under pro- 
fessional management for sustainable in- 
vestment is well above market growth rates 
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worldwide (2, 3). In the United States, sus- 
tainable investments grew 33% from 2014 
to 2016 and now total nearly $9 trillion, or 
about one-fifth of all professionally managed 
assets (4). Several initiatives have emerged to 
help guide sustainable investing, including 
the Principles for Responsible Investment, 
Sustainability Accounting Standards Board, 
and Global Reporting Initiative. While such 
efforts’ primary focus on minimizing a com- 
pany’s operational and supply chain risks 
is important—e.g., via emission reductions, 
efficiency gains, or improved worker health 
and safety—such evaluation procedures fall 
far short of what is needed for true verifica- 
tion of company-level sustainability perfor- 
mance and may produce assessments and 
investment decisions that are at odds with 
sustainable development. 


“.. more comprehensive approach to 

impact measurement could be realized 
if the financial sector joins forces with 
the scientific community.” 


Although the idea of linking actions to en- 
vironmental and social impacts is not new 
(e.g., in economics and policy analysis), its 
broader value to financial decision-making, 
in a way that makes sense to investors, has 
yet to be fully embraced. Many large, publicly 
traded companies are actively developing 
and putting to work technologies that create 
business opportunities going well beyond 
compliance per se, as with the explosion of 
energy- and life-saving innovations in the 
21st-century automobile, whose genesis can 
be traced to enforcement of the U.S. Clean Air 
Act of 1970 (5). Nevertheless, the positive so- 
cial and environmental impacts of individual 
companies and their business activities have 
proven difficult to systematically evaluate. 


SEEKING CONTEXT 


One active test bed for assessing corporate 
sustainability performance is in carbon 
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emissions management, with various proto- 
cols in place (e.g., Carbon Disclosure Project, 
Science-Based Targets). Given the well-mixed 
nature of the atmosphere, emitting one less 
ton of carbon should yield an identical pos- 
itive impact regardless of the source. Thus, 
estimating climate change mitigation bene- 
fits becomes the relatively straightforward 
task of crediting the efficiency of a compa- 
ny’s operations in avoiding emissions, often 
normalized to revenue, but with little atten- 
tion paid to how those operations contrib- 
ute to other environmental risks or how its 
products and services ultimately generate 
positive or negative impact (6). 

Carbon accounting will have limited 
usefulness in the broader development 
agenda, like that represented by the United 
Nations Sustainable Development Goals 
(SDGs), where climate change mitigation 
is but one of 17 focal points. The SDGs re- 
quire additional metrics on human health, 
access to clean water, poverty alleviation, 
biodiversity conservation, and many others. 
Positive contributions can no longer be con- 
sidered to operate in a well-mixed box, as 
for climate. Given the social and economic 
disparities that the SDGs seek to redress, 
investment strategies need to target par- 
ticular human beneficiaries or environmen- 
tal systems. What is needed is the notion 
of context, by which a company’s business 
model or manufacturing practices generate 
real-world impacts. 

Context-oriented metrics move 
the investment process one step 
closer to identifying actual solu- 
tions, which tabulations of modified 
corporate operations alone do not 
necessarily uncover. The approach, 
then, is to go beyond documenting 
changes in facility-scale outputs and 
combine them with what we term 
outcomes and impacts, which by their very 
nature evaluate corporate products and ser- 
vices within broader environmental or hu- 
man beneficiary settings. Such an expanded 
view of business practices, services, or prod- 
ucts consumed begins to uncover counter- 
intuitive results that are often masked by a 
singular focus on outputs. 

For example, for publicly held companies 
selling renewable energy technologies, reduc- 
tions in CO, and other air pollutant emissions 
at the smokestack (outputs) produce ambient 
air quality improvement (outcomes) and as- 
sociated health benefits (impacts). However, 
the tabulated impacts are often poorly cor- 
related with the originating outputs, espe- 
cially if the investment choices embody large 
differences in plant technologies, the local 
electrical grid, and potential beneficiary pop- 
ulations living downwind (7). This means that 
a small renewable energy company in a re- 


2 FEBRUARY 2018 * VOL 359 ISSUE 6375 523 


Slog ‘| Aveniqe 4 uo /Bio'bewaouslds a0ua!0s//:dyjy wold pepeojumMOog 


INSIGHTS | POLICY FORUM 


gion with high baseline levels of air pollution 
could generate greater positive impact on hu- 
man health than a larger company operating 
in a region where the air is cleaner and the 
electrical grid emits less pollution per mega- 
watt-hour generated. Apart from its financial 
performance, that smaller company may be 
passed over as a good candidate for inclusion 
within a sustainability portfolio if that judg- 
ment were based solely on its much lower 
outputs of clean energy at the plant level. 
Similarly, we have been designing hu- 
man health impact scores for pharmaceu- 
tical companies. The monotherapy drug 
effectiveness from U.S. Food and Drug Ad- 
ministration-approved clinical studies is 


combined with the number of prescriptions 


to an impact have been well enough estab- 
lished in many cases for us to adopt some 
basic assumptions to compute corporate 
contributions to sustainability—for example, 
knowing that CO, and other emissions from 
power plants contribute to climate change 
and bear negative human health effects and 
that renewables reduce both emissions and 
their climate and health burdens. Until there 
is more comprehensive disclosure of infor- 
mation, however, estimating the benefits that 
a company’s products and services accrue 
thus requires some pragmatic, simplifying 
assumptions (see the table). 

In our pilot work, we use relatively abun- 
dant revenue data as a proxy for business ac- 
tivity at the country level, based on available 


Context-based metrics for investment decisions 


Examples of metrics applied to holdings from the pension fund portfolio tested. Current data limitations 
necessitate practical simplifications that affect accuracy and precision. Access to more precise data on 
corporate products and services would enable the impact metrics to more fully capture environmental and 
public health context. The framework can adopt additional modules, representing alternative state-of-the-art 


impact assessment models and/or their outputs. 


ASSESSMENT MEASURES WASTEWATER TREATMENT 


RENEWABLE ENERGY PHARMACEUTICALS 


Impact metrics 


Data source: Combined 
business activity/technology/ 
sociobiogeophysical impact 
metrics, as proposed here 


Outputs: Volumes of waste- 
water treated (e.g., cubic 
meters per day) 


Outcomes: Pollutant 
concentration reductions 
in receiving waters (€.g., 
milligrams per liter) 


Impacts: Beneficiary 


populations drawing drinking 


water from downstream 
(1000s), stream length of 
improved fish habitat 


(kilometers below threshold) 


Outputs: Emissions 
reduced (CO,, PMo5, 
SO,, NO,) (e.g., metric 
tons per year) 


Outputs: Spatially 
distributed drug sales 


Outcomes: Target 
populations reached (e.g., 
Outcomes: Ambient air —_ millions; % of total) 
pollution improvement 
(e.g., parts per million; 

number of smog alerts) 


Impacts: Lives extended, 
hospitalizations and 

sick days avoided, 
reduced health-care 
spending 


Impacts: Lives extended, 
hospitalizations and sick 
days avoided, reduced 
health-care spending 


Financial (standards-based) Valuation 


Data source: Mandated corporate 
financial statements 


Corporate strategy and relative competitive position 


Environmental, social, and 
governmental norms 
(nonfinancial performance) 


Data source: Voluntary or 
mandated corporate self- 
disclosure, sustainability reports 


Environmental compliance 


per drug, epidemiological information from 
peer-reviewed articles, and official occupa- 
tional health reports for related diseases to 
estimate life spans extended, hospitaliza- 
tions averted, and sick days prevented by 
every drug. This information is then aggre- 
gated to the national scale, by company, to 
estimate overall health impact. 

Because companies typically fail to dis- 
close many of the details of what products 
they sell, where, and in what quantity, we are 
not yet able to explicitly link an investment 
dollar to a specific yield of impacts across an 
entire portfolio of large global companies. 
However, the causal chains linking an action 
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Governance and ethical policies 


corporate disclosures. We link these to esti- 
mates made by impact models operating on 
a geographical scale that captures the par- 
ticular relevant health and environmental 
context. For example, the wastewater treat- 
ment impact model operates regionally with 
product deployments that are assessed with 
respect to contrasting dilution potentials in 
receiving waters. Health benefits are com- 
puted using country-level revenue informa- 
tion prorated by population. 


DESIGN AND PRACTICE 
We see it as essential that metrics rely not 
only on self-disclosed corporate reports 
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but also on data from recent advances in 
Earth observation and modeling, epidemi- 
ology, public health, nutrition science, dem- 
ographic mapping, and life-cycle analysis of 
resource production and consumption, which 
all have the advantage of being, for the most 
part, transparent, reproducible, and peer-re- 
viewed. Although these technical capabilities 
are supporting breakthroughs in understand- 
ing the complexity and causal links that bind 
human-environment systems, we repurpose 
the science to define context. In some of our 
work on air and water pollution abatement, 
we have been successful in “draping” corpo- 
rate output data onto geospatial Earth system 
and human dimension data sets to compute 
outcome and impact indicators that explicitly 
link corporate manufacturing activities, 
products, and services to environmental and 
human beneficiaries (7, 8). 

Moving a step further, combining such 
measures with reported financial data 
would create intuitive, company-level met- 
rics of value to sustainable investment 
decision-making—for example, using our 
computed impacts per dollar of revenue as 
an indicator of sustainable investment “ef- 
ficiency” to facilitate comparisons across 
companies. This mixture of quantitative in- 
formation brings with it the critical issue of 
data harmonization in terms of thematic, 
spatial, and temporal coherency, but frame- 
works from other domains, such as the Earth 
system modeling framework community (9), 
provide useful guideposts. Protocols being 
developed under the next generation of the 
Global Earth Observation System of Systems 
(NextGEOSS) to accelerate the adoption of 
Earth observations into business practices 
are another such resource. 

Context-based metrics must also be able 
to evaluate cross-sectoral synergies, trade- 
offs, and unintended consequences associ- 
ated with multiple sustainability end points 
(e.g., improved water management, human 
health, and biodiversity conservation). A 
company that is highly rated for reducing 
air pollution could simultaneously present 
material risks to occupational health and 
safety, or at the expense of clean water. Con- 
text also helps investors to more clearly de- 
fine a company’s potential reputational risk. 
For example, portfolio managers might au- 
tomatically consider selling off holdings of 
a pharmaceutical company that is depleting 
water in a dry region and risking a major 
public backlash, but that decision might 
sensibly be reconsidered should the facili- 
ties serve large numbers of people in a de- 
veloping country, where the positive health 
benefits per dollar invested in drugs are far 
higher than in wealthier nations (0). 

A first and fundamental step is to estab- 
lish a formal dialogue space for information 
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exchange and best practices, organized as a 
strategic alliance of asset owners and man- 
agers, companies, auditors, nongovern- 
mental organizations, and scientists. The 
partnership would emphasize codesign of 
unified output-outcome-impact assessment 
frameworks and catalyze rational debate 
on the specific kinds of corporate disclo- 
sure and scientific data, their reporting 
frequencies, and spatial characteristics. 
In addition, the consortium would create 
momentum for third-party sustainability 
reporting and auditing standards, setting 
the stage for verifiable broad-scale indices. 
Established efforts like the Global Impact 
Investing Network, UN Global Compact, 
Sustainable Development Solutions Net- 
work, and World Business Council for Sus- 
tainable Development provide ready-made 
forums but would need to expand the cur- 
rent dialogue to consider new approaches 
that uncover and measure context. 


NEW BUSINESS MODELS 

To itself be sustainable, sustainable invest- 
ing must ultimately yield financial returns. 
In a global survey of sustainable investment 
organizations (3), 98% of respondents re- 
ported that their investments either met 
or exceeded expectations for social and/or 
environmental performance, whereas 91% 
reported satisfaction with financial per- 
formance. And, despite the willingness of 
some investors to accept below-market 
performance on some of their managed as- 
sets in the name of public good or to seed 
new investment frontiers (3), the assess- 
ment model being tested here assumes that 
sound business practices based on durable 
social or environmental benefits translate 
into profit. This investment strategy re- 
wards a long time horizon—the essence of 
sustainability—in lieu of short-range finan- 
cial performance, which preoccupies much 
of the current market. 

Although our own impact investment 
experiment has just begun, we see early 
promise that the fund being tested could 
ultimately generate returns competitive 
with a broad benchmark while producing 
measurable environmental and health ben- 
efits (11). Before initiating our study, stocks 
had already been selected on the basis of fi- 
nancial valuation; positive performance on 
environmental, social, and environmental 
risks in their operations and supply chains; 
and sales of products and services in four 
critical challenge arenas—water, climate 
change mitigation, human health, and food 
security. These impact categories, selected 
by the pension fund trustees themselves, are 
each indispensable to the global economy 
and present considerable environmental 
and social risks. Representative companies 
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were also chosen based on how well their 
products and services potentially contrib- 
ute to the SDGs. We are testing our metrics 
on this subset of pension fund assets. 

Our explicit assumption is that financial 
and impact metrics can be combined in a 
portfolio without adversely affecting finan- 
cial returns. Most important, well-designed 
impact metrics for public equity portfo- 
lios can be used effectively to inform asset 
owners, influence stakeholders, and shape 
investment decisions. The global effect of 
this transformation would be consequential 
and would reasonably be expected to save 
trillions of dollars by mitigating climate 
change, the growing water crisis, and wide- 
spread loss of ecosystem services (12-14). 

In the end, a more comprehensive ap- 
proach to impact measurement could be 
realized if the financial sector joins forces 
with the scientific community. The dialogue 
space proposed here could be used to ne- 
gotiate the terms of engagement and con- 
front difficult carrot-and-stick issues such 
as that regarding corporate data disclosure. 
If one agrees with the wisdom of mandated 
financial disclosures such as under the U.S. 


“The approach...is to... 
evaluate corporate products 
and services within broader 
environmental or human 
beneficiary settings.” 


Securities and Exchange Commission, built 
on well-established accounting principles 
and standards, then it is not unreasonable 
to warrant authentication of nonfinancial 
information that investors would find use- 
ful. In fact, more than 50 stock exchanges 
worldwide (e.g., Securities and Exchange 
Board of India) encourage or mandate such 
disclosures today (15). Might inclusion of a 
particular company in a sustainable asset 
portfolio be made contingent upon authen- 
tication of mandatory self disclosures? If so, 
the reward to that company, beyond direct 
revenue generated, would be a bona fide 
declaration that it is promoting sustainable 
business practices, especially if the impact 
measures show the company outperforming 
its peers or broader benchmarks. The will- 
ingness of corporations to share detailed 
information in such an alliance—as increas- 
ingly demanded by their investors—will be 
as much a shift in corporate culture as in 
how companies report on sustainability is- 
sues. If such a change is realized, we foresee 
a huge investment space opening, based on 
verifiable net positive impacts. 
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In the 12 years remaining to achieve the 
UN’s Sustainable Development Goals, we see 
an opportunity to road test this and similar 
context-based frameworks. If sustainable in- 
vestors can adopt suitable impact metrics to 
identify sustainable investment opportuni- 
ties, the community of nations will have in 
the business community a new and reliable 
ally, thereby increasing the likelihood that 
the goals will actually be realized within this 
short, ambitious time frame. 
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Picturing the future 


A new film celebrates a mid-century illustrator who stoked 
imaginations with prescient comics 


By Dov Greenbaum 


irected by Brett Ryan Bonowicz (The 
Perfect 46, 2014), Closer Than We 
Think is an eponymous documen- 
tary about a mid-20th-century comic 
strip and its creator, Arthur Rade- 
baugh. The film provides 


could readily understand and appreciate. His 
predictions were based on information that 
he obtained from researchers, which he du- 
tifully recorded in each strip. In one panel, 
for example, a colorful rendering of a solar- 
powered car accompanies a pithy interview 
with the vice president of the Chrysler Cor- 

poration, James Zeder. “If we con- 


important insight into what made === tinue to increase the efficiency of 
the comic a foundation for other Closer Than these [silicon] converters, and if we 
popular science fiction endeavors. We Think are able to develop small, efficient 
More important, however, it shows Brett Ryan energy storage cells, solar pow- 
how the strip directly encouraged Bonowicz, director ered cars will be feasible,’ Zeder 
nascent engineers and scientists Clindar, 2017, explains. In the future, translates 
to actualize the future described 85 minutes. Radebaugh, “[yJour solar sedan will 


therein. To this end, Radebaugh’s 
efforts succeeded where so many other sci- 
ence communication efforts continue to fail. 
Published weekly from 1958 through 1963 
in the Sunday funnies of more than 200 
newspapers, Radebaugh’s educational futur- 
ism provided playful, tongue-in-cheek visions 
of a technological utopia that the public 
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COMPUTER 
NAVIGATION 


Computers of the future will do 
far more then solve mathemati- 
cal problems or keep track of in- 
ventory. One unusual but prac- 
tical application might concern 
the safe navigation of aircraft. 


To accomplish this, radar-de- 
rived information on all aircraft 
within 100 miles (perhaps even 
more) would be fed into a com- 
puter box in the plane’s cockpit. 
The constantly changing cours- 
es of these aircraft would be re- | 
corded and, analyzed almost in- 
stantly by the computer navi- 
gator. The instrument would 
then direct its own plane on a 
safe course untraveled by the 
others. This would drastically 
reduce the danger of mid-air 
collisions or near-misses, no 
matter how clogged with traffic 
the airlanes’ became. 


take energy from sunrays and store 

it in accumulators that work like a battery.” 
Bonowicz’s documentary stresses that 
Radebaugh’s relationships with researchers 
were central to the success of his predictions, 
which included the VCR (“[W]hen a worth- 
while program comes over the air ... you'll be 
able to preserve both the picture and sound 
on tape for replaying at any time”), Skype and 
the globalization of the job market, and driv- 
erless vehicles (“Future trains will be fully au- 
tomatic—robots that can regulate their own 


“T WONITOR OFFICER 
aK Ge 
A 
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speed and control their own movements”). 
The film suggests that Radebaugh’s ability 
to connect to the lay public can also be attrib- 
uted to his technological optimism. By reject- 
ing the dystopian tropes common to science 
fiction, he enticed a younger generation to 
become interested in science and technology, 
encouraging them to envision fantastical 
technological futures. This mirrored a grow- 
ing feeling in society: Even during the chaos 
and fear of the Cold War—Closer Than We 
Think was bookended by Sputnik and Ken- 
nedy’s assassination—it was also a time of 
growing hope that technology could be har- 
nessed for the betterment of mankind. 
Before Closer Than We Think, Rade- 
baugh was a well-regarded commercial il- 
lustrator who saw the demand for his work 
plummet as photographs began to take 
the place of drawings in print publications 
and television advertising began to obtain 
prominence. He moved to Grand Rapids, 
Michigan, in 1967, ostensibly to retire with 
his wife, Nancy. There, he fell on hard times 
and lived out his days in relative obscurity. 
Bonowicz’s film, although an exhaustive 
examination of Radebaugh’s art and life, 
would have benefited from a more thorough 
exploration of both his scientific muses and 
his sources. A more in-depth exploration 
of those who were influenced by his work 
would also have been worthwhile. 
Radebaugh’s cartoons have an important 
take-home message for scientists: Learn to 
communicate with the public. They are lis- 
tening, and they want to learn. You never 
know who you might influence. & 


10.1126/science.aar6193 


Radebaugh’s vivid depictions of soon-to-be technologies resonated with the American public in the mid-20th century. 
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MEASUREMENT 


The fetishization of 


quantification 


A historian explores the dark side of metric-based 


performance evaluation 


By Theodore M. Porter 


he notion of a “metric” as a perfor- 
mance measure became _ familiar 
in the 1970s and 1980s as a tool of 
business management. Almost im- 
mediately, its use was extended to 
the assessment of a range of activities 
and institutions, from medical outcomes 
and educational programs to military proj- 
ects. The credibility of metrics rests in part 
on an affiliation with ideals of business ef- 
ficiency and in part on the supposition that 
measurement is tantamount to science. 
Although the numbers whose “tyranny” 
forms the subject of Jerry Muller’s timely 
book share some of the attributes of scien- 
tific measurement, their purposes are pri- 
marily administrative and political. They are 
designed to be incorporated into systems of 
what might be called “data-ocracy,’ often for 
the sake of public accountability: Schools, 
hospitals, and corporate divisions whose 
numbers meet or exceed their goals are to 
be rewarded, whereas poor numbers, taken 
to imply underperformance, may bring pen- 
alties or even annihilation. In The Tyranny 
of Metrics, Muller shows how teachers, doc- 
tors, researchers, and managers are driven to 
sacrifice the professional goals they value in 
order to improve their numbers. 
Scientists these days sometimes contrast 
“data-driven” science with institutional 
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routines—and especially with the fake 
news that seems now to run rampant in 
public debate. Yet metrics, when used to 
ease administration, have become grave- 
yards of such naive idealism. Data are 
almost never, as the etymology implies, 
“given,” but have to be actively 
made or taken. The appropriate 
uses of data are subject to in- 
tense discussion and debate and 
depend in most cases on the fram- 
ing and interpretation of models. 

Metrics are typically designed 
for routine administrative use, 
often in a context of political 
disagreement. As Muller shows, 
the expectation that they should 
be transparent leaves little space 
for expert interpreters to adjust 
or explain misleading measures 
of therapeutic effectiveness, school quality, 
research impacts, or even corporate profit- 
ability. Because none of these are concrete 
in a way that would permit rigorous mea- 
surement, such metrics can only function 
as indicators and can lead to terrible conse- 
quences when they are imposed reflexively. 

The American obsession with metrics of 
school performance, which took off in the 
early 1980s and is now increasingly global- 
ized, has probably done more harm than 
good, suggests Muller. Standardized tests are 
unable to capture what is really valuable in 
an education, and schools have no control 
over many of the variables that enter into 
measures of student performance. In one spe- 
cific sense, the metrics are counterproductive 
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The Tyranny 
of Metrics 


Jerry Z. Muller 
Princeton University 
Press, 2018. 232 pp. 


Overvaluing standardized test scores creates 
perverse incentives for teachers, argues Muller. 


because they encourage a focus on test per- 
formance rather than on the knowledge that 
a proper curriculum aims to advance. Metrics 
of this sort privilege symbols over reality. 

There are, likewise, many well-docu- 
mented instances of needy patients refused 
treatment for the sake of statistics. (“V.A. 
doctors say rating push hurts patient care,” 
proclaimed The New York Times on the front 
page on 1 January 2018.) When doctors or 
hospitals are judged by the success of opera- 
tions on patients with a particular diagnosis, 
they face a strong incentive to avoid treating 
the neediest patients, because these are ordi- 
narily the least likely to recover. 

Not infrequently, metrics produce almost 
the opposite results for which they were in- 
tended. Muller gives the example of hospital 
emergency rooms that improved their metric 
for timely admission of patients by allowing a 
line of ambulances to form outside. 

Muller’s basic arguments, as he acknowl- 
edges, are not new. Thoroughgoing critiques 
of performance metrics began appearing in 
the 1970s in response to a wave of extrava- 
gant ambitions for numbers-based manage- 
ment. A particularly noteworthy target of 
such criticism was Robert McNamara, who 
became infamous for his fetishization of body 
counts during the Vietnam War. 

There were at least two funda- 
mental problems with this metric: 
first, that the numbers could be 
fudged—for example, by includ- 
ing noncombatants in the death 
counts—and second, that the 
number of deaths did not seem to 
contribute much to winning the 
war. We must add the moral point 
that this metric encouraged the 
slaughter of innocents. 

In 1975, the American social 
psychologist Donald Campbell 
and the British economist C. A. E. 
Goodhart articulated independently the prin- 
ciple that reliance on measurement to incen- 
tivize behaviors leads almost inevitably to a 
corruption of the measures. Muller explains 
the logic of this corruption and defends, in 
place of indiscriminate numbers, an ideal of 
professional knowledge and experience. 

Measurement, he concludes, can contrib- 
ute to better performance, but only if the 
measures are designed to function in alli- 
ance with professional values rather than as 
an alternative to them. Good metrics cannot 
be detached from customs and practices but 
must depend on a willingness to immerse 
oneself in the work of these institutions. ® 
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Malaria in Venezuela 
requires response 


The Americas have witnessed a substantial 
decline in malaria-related morbidity (62%) 
and mortality (61%) during the past 15 
years as part of the implementation of the 
Global Malaria Action Plan (7). Venezuela, 
the first World Health Organization 
(WHO)-certified country to eradicate 
malaria in 1961 (2), has been the alarm- 
ing exception in the region, displaying an 
unprecedented 365% increase in malaria 
cases between 2000 and 2015 (J). In 2016 
alone, 240,613 malaria cases were officially 
reported (3). Worryingly, 2017 witnessed 
an increase of 68% in the cumulative 
number of cases compared to the previ- 
ous year, totaling 319,765 malaria cases by 
21 October (4). The disease has spread to 
areas where malaria was eradicated previ- 
ously (such as near the capital, Caracas), 
prompting alarm in the health sector. 
Moreover, malaria cases have overloaded 
frontier health care infrastructure in Brazil 
and Colombia, where 78 and 81%, respec- 
tively, of imported malaria cases in 2016 
originated in Venezuela (5). 

Economic and political mismanage- 
ment have precipitated a general collapse 
of Venezuela’s health system, creating an 
ongoing humanitarian crisis with severe 
social consequences (6, 7). The malaria 
epidemic has been fueled by financial 
constraints for procurement of malaria 
commodities (such as insecticides, drugs, 
diagnostic supplies, and mosquito nets) 
and surveillance activities, internal human 
migration associated with illegal gold 
mining, and lack of provision and imple- 
mentation of services (2, 3). The continued 
upsurge of malaria in Venezuela threatens 
to become uncontrollable, and it is jeopar- 
dizing the hard-won gains in the Americas’ 
elimination agenda and global malaria tar- 
gets. There is an urgent need for support 
and action from WHO, United Nations 
agencies, the U.S. Agency for International 
Development, the Global Fund, and other 
international organizations to control this 
epidemic. Failure to address this danger- 
ous situation may result in one of the 
worst malaria epidemics in the history of 
the Americas. 
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Venezuelans wait outside a health center for malaria treatment in November, 2017. 
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Integrated approach 
to malaria control 


To sustain reductions in child mortality 
due to malaria, J. Hemingway argues that 
new public health insecticides are required 
(“The way forward for vector control,” 
Perspectives, 24 November 2017, p. 998). 
However, because of the heavy selection 
pressure placed on mosquito populations, 
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the use of new compounds will likely lead 
to the development of new mechanisms to 
resist them. Integrated vector management 
programs should therefore devote equal 
attention to noninsecticide methods. 

Numerous biological control agents, 
from viruses to predatory fish, have been 
evaluated for effectiveness in controlling 
malaria mosquitoes. Foremost, the use of 
the bacterium Bacillus thuringiensis var. 
israelensis has substantially diminished 
malaria when applied to aquatic habitats, 
such as in urban areas of Tanzania (1) 
and highland villages in Kenya (2). Adult 
mosquitoes can also be controlled with 
entomopathogenic fungi (3). Research has 
focused on the development of innovative 
delivery platforms of fungal spores (4) as 
well as genetic modification of the fungus 
to increase effectiveness (5). 

People affected by malaria often reside 
in poorly constructed houses with oppor- 
tunities for mosquito entry. Improved 
housing and living conditions protect 
against malaria (6), as well as providing 
other health benefits, such as reducing 
exposure to Culex mosquitoes, vectors of 
the parasitic disease filariasis (7). 

Lastly, manipulating mosquito behavior 
through exploitation of volatile info- 
chemicals presents opportunities for 
pesticide-free vector control. For example, 
a large intervention trial using odor- 
baited mosquito traps removed up to 70% 
of the Anopheles funestus population, 
resulting in a 30% decrease in malaria (8). 

Integration of a variety of tools for 
malaria control is central to the Global 
Vector Control Response 2017-2030 
(9). The plan emphasizes integrated 
vector management as a pillar of sus- 
tainable malaria control, in addition 
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to engagement of communities, invest- 
ments in research, and capacity building. 
Pesticide resistance needs to be addressed 
in this context, not by a narrow focus 

on new pesticides alone. Only with an 
integrated approach can we make malaria 
elimination a realistic target. 
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Response 


Koenraadt and Takken argue that, in addi- 
tion to finding new mosquito vector control 
insecticides to replace those that are cur- 
rently failing due to resistance, we devote 
equal attention to alternative methods. 
We agree with the principle of reducing 
selection pressure on new insecticides by 
combining them with other robust, cost- 
effective, operationally viable vector control 
activities. The challenge lies in finding 
properly evaluated interventions that can 
be recommended for use and deployed 
cost-effectively in a variety of transmis- 
sion settings. The alternative interventions 
cited by Koenraadt and Takken have never 
been successfully used operationally on a 
large scale. Although some interventions 
have been shown to be effective in small- 
scale studies, the vector control literature 
is awash with poorly designed interven- 
tion “trials” that are riddled with bias, 
lack robust controls, and have insufficient 
statistical power (J-3). Financial support 
for malaria control is already well below the 
investment needed for interventions that 
work (1-3). Reducing coverage even further 
by diverting resources to unproven alterna- 
tive interventions is ethically unacceptable. 
In 2004, the Cochrane infectious disease 
group began commissioning a series of 
systematic reviews to assess the evidence 
for various interventions. One review 
concluded that electronic buzzer devices, 
sold directly to the public as mosquito 
repellents, do not work and should not be 
used to reduce mosquito bites or malaria 
infection rates (7). The 2013 review on lar- 
val source management, which assessed all 
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published and unpublished data from 1900 
to 2012, located only 13 studies of sufficient 
quality to include in the assessment. The 
authors concluded that larval source man- 
agement is a valid policy option in limited 
situations where sufficient breeding sites 
can be targeted. However, they pointed 
out that further research is still needed to 
evaluate larval source management feasi- 
bility in rural Africa (2). 

The review on larvivorous fish (2013) 
concluded that there were no reliable stud- 
ies demonstrating an impact on malaria 
infection rates in nearby communities and 
that all the studies reporting reductions in 
mosquito numbers had a high risk of bias. 
The evidence for use of fish was insufficient 
to recommend their use (3). The review 
on larvicides, which will include Bacillus 
thuringiensis var. israelensis, was commis- 
sioned in 2017, and the protocol for the study 
search has been published (4). It remains 
to be seen how many of the publications on 
larval interventions are sufficiently robust 
and lacking in bias to warrant inclusion. 

Other potential interventions, such as 
entomopathogenic fungi, are not yet at the 
point where the evidence can be sensibly 
evaluated. The fatal flaw in this interven- 
tion, which has yet to be overcome despite 
the efforts of several groups, is the inability 
of the spore formulation to survive for any 
significant periods of time at tropical storage 
temperatures, severely limiting its shelf life. 

A few interventions, such as odor-baited 
traps and house eave tubes, are entering 
large-scale trials, most with substantial 
funding from the Bill and Melinda Gates 
Foundation (5). These trials take several 
years and will need to be completed before 
they generate the evidence that may allow 
them to be recommended for wide-scale 
operational use. 

Reviews on long-lasting nets (2004) 
and indoor residual spraying (2010) sup- 
ported the use of these interventions (6, 

7). Therefore, limited resources should be 
directed to these solutions. 
Janet Hemingway 
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Avoiding polar catastrophe 
on perovskite surfaces 
Setvin et al., p. 572 
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GENETICS 


Myoediting for Duchenne 
muscular dystrophy 


uchenne muscular dystrophy (DMD) 
is an X-linked recessive disorder that 
results in progressive weakening 
and muscle loss. Long et al. used 
CRISPR-Cas9 genome editing to 


NEAR-FIELD COSMOLOGY 
Dwarf galaxies move 
in unexpected ways 


Massive galaxies like our Milky 
Way are orbited by satel- 

lite dwarf galaxies. Standard 
cosmological simulations of 
galaxy formation predict that 
these satellites should move 
randomly around their host. 
Muller et al. examined the 
satellites of the nearby ellipti- 
cal galaxy Centaurus A (see the 
Perspective by Boylan-Kolchin). 
They found that the satellites are 
distributed in a planar arrange- 
ment, and the members of the 
plane are orbiting in a coherent 
direction. This is inconsistent 
with more than 99% of com- 
parable galaxies in simulations. 
Centaurus A, the Milky Way, 
and Andromeda all have highly 
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statistically unlikely satellite 
systems. This observational evi- 
dence suggests that something 
is wrong with standard cosmo- 
logical simulations. —KTS 
Science, this issue p. 534; 
see also p.520 


EMERGING INFECTIONS 


Fearsome flaviviruses 
Although Zika virus was identified 
more than half a century ago, its 
impact on the unborn children 

of infected pregnant women was 
only detected in the recent dev- 
astating epidemic. Unfortunately, 
Zika virus may not be unique in its 
ability to inflict fetal damage. To 
investigate this possibility, Platt 
et al. infected pregnant immuno- 
competent mice with related 
viruses. Two other flaviviruses, 
West Nile virus and Powassan 
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virus, caused fetal demise. These 
viruses could also replicate in 
human maternal and fetal explant 
tissue. If these or other neuro- 
tropic flaviviruses take off in the 
human population, we may again 
experience congenital infections 
with devastating effects. —LP 

Sci. Trans|. Med. 10, eaao7090 (2018). 


West Nile virus, like its cousin Zika, 
may damage fetuses. 
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correct the hotspot mutations that dis- 
rupt expression of the dystrophin gene in 
DMD. So-called myoediting abolished RNA 
splice sites and allowed the removal of the 
most common mutations. The approach 
successfully restored dystrophin 
expression in patient-derived induced 
pluripotent stem cells and restored 
mechanical force contraction in derived 
cardiomyocytes to near normal. —ASH 

Sci. Adv. 10.1126/sciadv.aap9004 (2018). 


Anew technology brings hope 
in the fight against DMD. 


TOPOLOGICAL MATTER 
A phonon 


merry-go-round 


Chirality is associated with the 
breaking of symmetry, often 
described as left- or right-handed 
behavior. Such asymmetry can 
be seen, for example, in the 
electronic responses of par- 
ticular materials or the reactions 
between particular chemical 
species. Zhu et al. observed 
achiral phonon mode ina 
monolayer of the transition metal 
dichalcogenide WSe,, detected 
spectroscopically as the circular 
dichroism of the phonon-assisted 
transition of holes. Phonon chiral- 
ity could be used to control the 
electron-phonon coupling and/or 
the phonon-driven topological 
states of solids. —ISO 

Science, this issue p. 579 
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Remember the sugar 
when making proteins 


Eukaryotes have an elaborate 
trafficking and quality-control sys- 
tem for secreted glycoproteins. 
The glycosylation pathway begins 
in the endoplasmic reticulum 

with the enzyme oligosaccharyl- 
transferase (OST), which 

attaches a long chain of sugars 

to asparagine residues of target 
proteins. Wild et al. report a cryo— 
electron microscopy structure of 
yeast OST, which includes eight 
separate membrane proteins. The 
central catalytic subunit contains 
binding sites for substrates and 

is flanked by accessory subunits 
that may facilitate delivery of 
newly translocated proteins for 
glycosylation. —MAF 


Science, this issue p.545 


Structure of the 
human spliceosome 


Catalyzed by the spliceosome, 
precursor MRNA splicing pro- 
ceeds in two steps: branching 
and exon ligation. Transition from 
the C (catalytic post-branching 
spliceosome) to the C* (catalytic 
pre-exon ligation spliceosome) 
complex is driven by the adeno- 
sine triphosphatase/helicase 
Prpl6. Zhan et al. report the 
cryo-electron microscopy struc- 
ture of the human C complex, 
showing that two step | splicing 
factors stabilize the active site 
and link it to Prp16. —SYM 


Science, this issue p. 537 


Ademanding lifestyle 
Polar bears appear to be well 
adapted to the extreme condi- 
tions of their Arctic habitat. 
Pagano et al., however, show that 
the energy balance in this harsh 
environment is narrower than we 
might expect (see the Perspective 
by Whiteman). They monitored 
the behavior and metabolic rates 
of nine free-ranging polar bears 
over 2 years. They found that high 
energy demands required con- 
sumption of high-fat prey, such as 
seals, which are easy to come by 
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on sea ice but nearly unavailable 
in ice-free conditions. Thus, as sea 
ice becomes increasingly short- 
lived annually, polar bears are 
likely to experience increasingly 
stressful conditions and higher 
mortality rates. —SNV 

Science, this issue p. 568; 

see also p. 514 


Biofilms provide refuge 
for cancerous bacteria 


Familial adenomatous polyposis 
(FAP) causes benign polyps 
along the colon. If left untreated, 
FAP leads to a high incidence of 
colon cancer. To understand how 
polyps influence tumor forma- 
tion, Dejea et al. examined the 
colonic mucosa of FAP patients. 
They discovered biofilms contain- 
ing the carcinogenic versions of 
the bacterial species Escherichia 
coli and Bacteroides fragilis. 
Colon tissue from FAP patients 
exhibited greater expression of 
two bacterial genes that produce 
secreted oncotoxins. Studies in 
mice showed that specific bacte- 
ria could work together to induce 
colon inflammation and tumor 
formation. -PNK 

Science, this issue p. 592 


Self-defense by 

avoiding self-targeting 

By silencing transposons, 
Piwi-interacting RNAs (piRNAs) 
protect the stability of animal 
genomes in germ lines. However, 
many piRNAs do not map to 
transposons, and their func- 
tions have remained undefined. 
Zhang et al. described the piRNA 
targeting logic in Caenorhabditis 
elegans and identified an intrinsic 
sequence signal in endogenous 
germline genes that confer resis- 
tance to piRNA silencing. Thus, 
diverse piRNAs silence foreign 
nucleic acids but spare self genes 
to defend the C. elegans genome. 
In addition, multiple foreign 
transgenes can be engineered to 
escape piRNA targeting, allowing 
successful expression in the 
germline. —SYM 


Science, this issue p. 587 
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Confocal micrograph 
of Caenorhabditis 
elegans worms 


CELL BIOLOGY 
Targeting the demise of 
male mitochondria 


n the nematode worm Caenorhabditis elegans, as in 
humans, offspring inherit mitochondria and mitochon- 
drial DNA only from their mother. Sato et al. identified key 
molecular components of the machinery that recognizes 
male mitochondria and targets them for degradation. Ina 
screen for kinases that might be required for the process, they 
identified IKKE-1, a protein kinase with similarity to the mam- 
malian protein kinases TBK1 and IKBKE. IKKE-1 interacted 
with a protein that they named ALLO-1, which had sequence 
similarity to autophagy receptors and was required for clear- 
ance of paternal organelles. The similarity of the role of IKKE-1 
in removing male mitochondria to that of mammalian TBK1, 
which functions in the innate immune response, hints that the 
two processes might have a similar evolutionary origin. —LBR 
Nat. Cell Biol. 10.1038/s41556-017-0008-9 (2018). 


connecting diets of high-energy, 
processed food to inflamma- 
tion? Christ et al. show that 
mice fed on a high-calorie 
Western diet developed systemic 
inflammation and functionally 
reprogrammed granulocyte 
monocyte precursor cells. 

After a standard chow diet was 
restored to the animals, inflam- 
mation persisted, indicating that 


High caloric intake 
induces inflammation 


The consumption of high-calorie 
diets has become prevalent 

in industrialized nations and 

is associated with increased 
body mass, inflammation, and 
resulting pathologies. What are 
the underlying mechanisms 
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OCEAN PHYSICS 


Shear by the centimeter 


redicting the transport of oil and 
floating debris in the ocean is very 
difficult because it is challenging 
to measure differences in water 
motion very close to the surface. 
Laxague et al. present measurements 
of the ocean's current vector profile 
made in the Gulf of Mexico, showing 
that the velocity of the uppermost 
centimeter of the water column was 
four times that at a depth of 10 m and 
displaced by nearly 90°. Therefore, 
incorporating these kinds of dynamics 
into forecasting efforts is essential for 
better understanding the movement 
and fates of plastic particles and spilled 


components in the Western diet 
induced innate immune memory. 
This study identifies possible tar- 
gets for preventing diet-induced 
inflammation. —GKA 

Cell 172, 162 (2018). 


COMPARATIVE GENOMICS 
Using phylogeny to 
test evolution 


Sequencing of whole genomes 
and obtaining information about 
gene transcription offer the 
promise to compare changes 
and variation in gene expression 
and function across species. 
However, Dunn et al. show that 
analyses of gene expression data 
that use pairwise comparisons 
across multiple species may not 
be able to distinguish between 
differing models of evolution. 

By examining published studies 
of how gene expression may 
change across species, they 
show that it is necessary to 
consider the mode of evolution— 
i.e., duplication generating 
paralogs or orthologous evolu- 
tion through speciation—as well 
as the time since the species 
diverged. Thus, phylogenetic 
information is needed to discern 
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commonalities and differences 
across species and among genes 
that share an ancestor but may 
have undergone different evolu- 
tionary trajectories. —LMZ 
Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 
pnas.1707515115 (2018). 


STAR FORMATION 
Many stars don’t form 


in clusters 

Most stars are thought to form 
in dense, gravitationally bound 
molecular clouds, which should 
produce a bound cluster of 
stars. As unused gas is expelled 
from the system by stellar 
feedback, the cluster becomes 
gravitationally unbound to form 
an association, which gradu- 
ally drifts apart. Associations of 
young stars (no more than a few 
million years old) should there- 
fore be slowly expanding. Ward 
and Kruijssen use astrometric 
data to test this idea by looking 
for evidence of expansion in 
several nearby young associa- 
tions. They do not find any; the 
associations show no sign of 
ever having been gravitationally 
bound. The authors suggest that 
this indicates that star formation 
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is dominated by turbulent 
fragmentation, not monolithic 
collapse. —KTS 
Mon. Not. R. Astron. Soc. 10.1093/ 
mnras/sty117 (2018). 


ANTIBIOTICS 

Recycling antibiotic 
sensitivity 

Cystic fibrosis (CF) is accompa- 
nied by chronic lung infections, 
requiring lifelong consumption 
of many antibiotics to maintain 
airway function. Antibiotic resis- 
tance and clinical deterioration 
are apparently inevitable. Ina 
multifactorial study, Imamovic 
et al. discovered that mutations 
in Pseudomonas aeruginosa 

that endow resistance to some 
classes of antibiotic concomi- 
tantly result in sensitivity to 
others. For example, in any strain 
of P aeruginosa, resistance to 
the fluoroquinolone ciprofloxacin 
was consistently associated with 
sensitivity to aminoglycoside 
antibiotics, such as amikacin. 

In this case, resistance was 
mediated by mutations in the 
transcriptional regulator nfxB 

of the MexC transporter. Using 
biomarkers such as nfxB, it 
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oil in the oceans. —HJS 


Geophys. Res. Lett. 10.1002/ 
2017GL075891 (2017). 


Oil from the 2011 MV Rena spill 
surrounds a service vessel. 


appears to be possible to design 
antibiotic regimens for individual 
CF sufferers that flip resistant 
bacteria between predictable 
drug-sensitive states. —CA 

Cell 10.1016/j.cell.2017.12.012 (2018). 


MICROBIOLOGY 


Cutting up in the deep 
Life in the deep ocean is cut 
off from sunlight as an energy 
source. Fixed carbon from the 
surface—namely, dead microbes 
and larger organisms—therefore 
serves as a source of energy 
at depths below the reach of 
sunlight. Bergauer et al. analyzed 
genomic and proteomic data 
from microorganisms collected 
at depths down to 4 to 5 km. 
Microbial transport proteins 
varied only slightly in abundance 
with depth, reflecting changes in 
the abundance of organisms and 
in lifestyle. Organic acid trans- 
porters were found throughout 
the water column and were more 
abundant in the deep ocean, 
suggesting that uptake of dilute 
dissolved organic matter is 
crucial for microbial metabolism 
at these depths. —MAF 

Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 

pnas.1708779115 (2018). 
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EARTH SYSTEMS 
Integrating the biosphere 


into climate models 
High-quality climate predictions 
are crucial for understanding the 
impacts of different green- 
house gas emission scenarios 
and for mitigating and adapt- 
ing to the resulting climatic 
changes. Bonan and Doney 
review advances in Earth system 
models that include the ter- 
restrial and marine biosphere. 
Such models capture interac- 
tions between physical and 
biological aspects of the Earth 
system. This provides insight 
into climate impacts of societal 
importance, such as altered crop 
yields, wildfire risk, and water 
availability. Further research is 
needed to better understand 
model uncertainties, some of 
which may be unavoidable, and 
to better translate observations 
into abstract model representa- 
tions. —JFU 


Science, this issue p. 533 


NEURODEVELOPMENT 
Brain mutations, 
young and old 


Most neurons that make up the 
human brain are postmitotic, 
living and functioning for a very 
long time without renewal (see 
the Perspective by Lee). Bae 

et al. examined the genomes 

of single neurons from the 
prenatal developing human 
brain. Both the type of mutation 
and the rates of accumulation 
changed between gastrulation 
and neurogenesis. These early 
mutations could be generat- 
ing useful neuronal diversity or 
could predispose individuals to 
later dysfunction. Lodato et al. 
also found that neurons take on 
somatic mutations as they age 
by sequencing single neurons 
from subjects aged 4 months 
to 82 years. Somatic mutations 
accumulated with increasing 
age and accumulated faster in 
individuals affected by inborn 
errors in DNA repair. Postmitotic 
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mutations might only affect one 
neuron, but the accumulated 
divergence of genomes across 
the brain could affect function. 
—PJH 
Science, this issue p. 550, p. 555; 
see also p.521 


SURFACE SCIENCE 
Compensating 
a polar surface 


Anionic crystal surface can be 
electrostatically unstable, and 
the surface must reconstruct in 
some way to avoid this “polar 
catastrophe.” Setvin et al. used 
scanning probe microscopies 
and density functional theory 

to study the changes in the 

polar surface of the perovskite 
KTaO,. They observed several 
structural reconstructions as the 
surface cleaved in vacuum was 
heated to higher temperatures. 
These ranged from surface 
distortions to the formation of 
oxygen vacancies to the develop- 
ment of KO and TaO, stripes. 
Hydroxylation after exposure to 
water vapor also stabilized the 
surface. —PDS 


Science, this issue p. 572 


CARBON FIXATION 
About-face for 


citrate synthase 


Classically, it is thought that 
citrate synthase only works in 
one direction: to catalyze the 
production of citrate from acety! 
coenzyme A and oxaloacetate 

in the tricarboxylic acid (TCA) 
cycle. The TCA cycle can run in 
reverse to cleave citrate and fix 
carbon dioxide autotrophically, 
but this was thought to occur 
only with alternative enzymes, 
such as citrate lyase. Now 
Nunoura et al. and Mall et al. 
have discovered thermophilic 
bacteria with highly efficient and 
reversible citrate synthase that 
requires reduced ferredoxin (see 
the Perspective by Ragsdale). 
This function is undetectable 

by metagenomics, but classical 
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biochemistry filled in the gaps 
seen between the genome 
sequences and the phenotypes 
of the organisms. The direction 
of catalysis depends on the 
availability of organic versus 
inorganic carbon and reflects a 
flexible bet-hedging strategy for 
survival in fluctuating environ- 
ments. In evolutionary terms, 
this capacity might predate 
the classical TCA cycle and is 
likely to occur in a wide range of 
anaerobic microorganisms. —CA 
Science, this issue p. 559, p. 563; 
see also p. 517 


CANCER IMMUNOTHERAPY 
HLA genotype 
affects response 


Immunotherapy works by 
activating the patient's own 
immune system to fight cancer. 
For effective tumor killing, 
CD8* T cells recognize tumor 
peptides presented by human 
leukocyte antigen class | (HLA-l) 
molecules. In humans, there are 
three major HLA-I genes (HLA-A, 
HLA-B, and HLA-C). Chowell et 
al. asked whether germline HLA-| 
genotype influences how T cells 
recognize tumor peptides and 
respond to checkpoint inhibi- 
tor immunotherapies (see the 
Perspective by Kvistborg and 
Yewdell). They examined more 
than 1500 patients and found 
that heterozygosity at HLA-| 
loci was associated with better 
survival than homozygosity for 
one or more HLA-| genes. Thus, 
specific HLA-| mutations could 
have implications for immune 
recognition and for the design of 
epitopes for cancer vaccines and 
immunotherapies. —PNK 
Science, this issue p. 582; 
see also p. 516 


AUTOIMMUNE DISEASE 
The X chromosome 
link to lupus 


Nine out of 10 people who 
develop systemic lupus ery- 
thematosus (SLE) are women. 


Published by AAAS 


Furthermore, individuals with 
Klinefelter syndrome (47,XXY) 
also have increased incidence 
of SLE. This suggests that X 
chromosome dosage could be 
an important risk factor in SLE. 
Using sensitive quantification 
methods, Souyris et al. demon- 
strate that Toll-like receptor 7 
(TLR7), which is encoded on the 
X chromosome, escapes X inacti- 
vation in B cells and myeloid 
cells in females and Klinefelter 
individuals. TLR7 binds single- 
stranded RNA and activates 
type | interferon signaling, a 
pathway that is also activated 
in SLE patients. Thus, biallelic 
expression of TLR7 appears to 
contribute to greater SLE risk. 
—AB 

Sci. Immunol. 3, eaap8855 (2018). 


CANCER 


The cytoskeleton-EGFR 
connection 


Some cancers are caused by 
aberrant epidermal growth fac- 
tor receptor (EGFR) signaling. 
Two papers now reveal connec- 
tions between this receptor and 
cytoskeletal remodeling (see the 
Focus by Chiasson-MacKenzie 
and McClatchey). Pike et al. 
delineated a cytoskeletal 
pathway that delayed receptor 
internalization, thereby increas- 
ing EGFR signal duration from 
the membrane. Roth et al. identi- 
fied a cytoskeleton-associated 
effector of EGFR signaling that 
promoted migration and prolif- 
eration of mammary epithelial 
cells and was associated with 
poor prognosis in breast cancer 
patients. —LKF 

Sci. Signal. 10, eaan0949, eaaql060; 

see also eaas9473 (2018). 


SUPERCONDUCTIVITY 
Anonlinear peek 

into stripes 

In many theoretical models 
of high-temperature super- 


conductors, remnants of 
superconductivity persist to 


sciencemag.org SCIENCE 


temperatures higher than the 
transition temperature, T,. 
Rajasekaran et al. used nonlinear 
terahertz spectroscopy to probe 
this region of the phase diagram 
of a cuprate superconductor that 
is well known for a stripe phase 
that appears for certain dop- 
ing levels (See the Perspective 
by Ergecen and Gedik). For a 
sample deep in the stripe phase, 
a large nonlinear signal persisted 
from the superconducting region 
up to temperatures much higher 
than T.. The findings suggest the 
formation of a peculiar spatially 
modulated superconducting 
state called the pair-density 
wave. —JS 

Science, this issue p. 575; 

see also p. 519 
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Climate, ecosystems, and planetary 
futures: The challenge to predict life 
in Earth system models 


Gordon B. Bonan* and Scott C. Doney* 


BACKGROUND: Earth system models (ESMs) 
simulate physical, chemical, and biological pro- 
cesses that underlie climate and are the most 
complex in a hierarchy of models of Earth’s 
interacting atmosphere-land-ocean-sea ice sys- 
tem. As terrestrial and marine ecosystems have 
been added to ESMs, the distinction between 
the physical basis for climate change, mitigation, 
and vulnerability, impacts, and adaptation (VIA) 
no longer necessarily holds. The same global 
change stresses that affect terrestrial and ma- 
rine ecosystems are critical processes that deter- 
mine the magnitude and trajectory of climate 
change, and many of the interventions that might 
lessen anthropogenic climate change pertain 
to the biosphere. Here we describe environ- 
mental changes that are stressing terrestrial 
and marine ecosystems. We discuss how these 
stressors are being included in ESMs, initially 
with an emphasis on climate processes, but 
also show their emerging utility for VIA an- 
alyses and examine them in the context of 
Earth system prediction. 


The various models 
used for climate pro- 
jections and mitigation 
and VIA analyses 
overlap in scope and 
would benefit from a 


broad perspective Carbon 

of Earth system predic- SH 

tion. Shown are the Tree mortality 

domains of ESMs, mitiga- Hapitaiioss 

tion models, and VIA ' 
Forest fires 


models along axes from 
VIA to climate processes 
(horizontal) and from pri- 
marily serving the research 
community to informing 
societal needs (vertical). 
Panels show forests and 
agriculture (left) and 
marine ecosystems (right) 
as represented across 
modeling domains. 


VIA 
Crop yield 


Water use 


Bonan et al., Science 359, 533 (2018) 


Land: Forest and agriculture 


Ecology, hydrology 
& economic models 


Timber & forest products 


2 February 2018 


ADVANCES: Terrestrial ecosystems face stresses 
from changing climate and atmospheric com- 
position that alter phenology, growing season 
length, and community composition; these 
stresses enhance productivity and water-use 
efficiency in some regions, but also lead to 
mortality and increased disturbances from wild- 
fires, insects, and extreme events in other re- 
gions. The addition of reactive nitrogen, elevated 
levels of tropospheric O3, and anthropogenic 
land-use and land-cover change stress ecosys- 
tems as well. The terrestrial biosphere models 
included in ESMs simulate the ecological im- 
pacts of these stresses and their effects on Earth 
system functioning. Ocean ecosystems and living 
marine resources face threats from ocean warm- 
ing, changing large-scale circulation, increased 
vertical stratification, declining oxygen, and 
acidification, which alter nutrient supply, the 
light environment, and phytoplankton produc- 
tivity; result in coral bleaching; and produce 
novel marine communities. Three-dimensional 
ocean models simulate the carbon cycle and 
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Biomass burning 
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Research 


Ecology & water 
quality models 


Fisheries 


Coral bleaching 
Low O,-hypoxia 
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wetlands 


associated biogeochemistry. Plankton ecosystem 
models both drive biogeochemistry models and 
characterize marine ecological dynamics. 


OUTLOOK: The untapped potential of ESMs 
is to bring dispersed terrestrial and marine eco- 
system research related to climate processes, 
VIA, and mitigation into a common framework. 
ESMs offer an opportunity to move beyond phy- 
sical descriptors of atmospheric and oceanic 


states to societally relevant 
quantities such as habitat 
Read the full article 1088, water availability, wild- 


fire risk, air quality, and 
crop, fishery, and timber 
yields. To do so, the science 
of climate prediction has to 
be extended to a more multifaceted Earth system 
prediction, including the biosphere and its re- 
sources. ESMs provide the means not just to 
assess the potential for future global change 
stresses, but also to determine the outcome of 
those stresses on the biosphere. Such Earth sys- 
tem prediction is necessary to inform sound 
policy that maintains a healthy biosphere and 
provides the food, energy, and fresh water needed 
for a growing global population without further 
exacerbating climate change. Substantial impedi- 
ments that must be overcome include advancing 
our knowledge of biosphere-related climate pro- 
cesses; reducing model uncertainty; and effec- 
tively communicating among, rather than across, 
the disparate science communities of climate 
prediction, global biosphere modeling, VIA analy- 
ses, and climate change mitigation. 


at http://dx.doi. 
org/10.1126/ 
science.aam8328 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: bonan@ucar.edu; (G.B.B.); 
sdoney@virginia.edu (S.C.D.) 
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EARTH SYSTEMS 


Climate, ecosystems, and planetary 
futures: The challenge to predict life 
in Earth system models 


Gordon B. Bonan™ and Scott C. Doney”* 


Many global change stresses on terrestrial and marine ecosystems affect not only 
ecosystem services that are essential to humankind, but also the trajectory of future 
climate by altering energy and mass exchanges with the atmosphere. Earth system models, 
which simulate terrestrial and marine ecosystems and biogeochemical cycles, offer a 
common framework for ecological research related to climate processes; analyses of 
vulnerability, impacts, and adaptation; and climate change mitigation. They provide an 
opportunity to move beyond physical descriptors of atmospheric and oceanic states to 
societally relevant quantities such as wildfire risk, habitat loss, water availability, and crop, 
fishery, and timber yields. To achieve this, the science of climate prediction must be 
extended to a more multifaceted Earth system prediction that includes the biosphere and 


its resources. 


uman activities are transforming Earth’s 
atmosphere, ocean, and land surfaces at a 

scale and magnitude not previously seen 
during the past several thousand years of 
human history. These changes threaten 
healthy planetary functions and socioeconomic 
well-being (J, 2). Fossil fuel combustion, indus- 
trialized agriculture, urbanization, and other facets 
of modern human societies are changing climate 
and atmospheric composition; melting permafrost, 
glaciers, ice sheets, and Arctic sea ice; raising sea 
levels; warming and acidifying the oceans; pollut- 
ing air, water, and soils; altering biogeochemical 
cycles and freshwater availability; increasing the 
cycling of reactive nitrogen; reducing forest cover 
and degrading land; and destroying habitats and 
reducing biodiversity (3-5). The ecological con- 
sequences of these changes are apparent in in- 
dividual organisms, the communities they inhabit, 
and the ecosystems in which they function (6-8). 
The interconnectedness and global scope of 
this changing environment have transformed 
the scientific study of Earth as a system. It is now 
understood that climate change must be studied 
in terms of a myriad of interrelated physical, 
chemical, biological, and socioeconomic pro- 
cesses. This broadening basis for climate change 
research underlies the transformation from global 
climate models to Earth system models (ESMs). 
These models have shown that the biosphere not 
only responds to climate change, but also directly 
influences the direction and magnitude of cli- 
mate change. Terrestrial and marine ecosystems, 
and their uses by humans, are fundamental to 


‘National Center for Atmospheric Research (NCAR), Boulder, 
CO 80307, USA. “Department of Environmental Sciences, 
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addressing the climate change problem. How 
do we provide the food, energy, and fresh water 
needed for a growing global population without 
further exacerbating climate change? Can terres- 
trial and marine ecosystems be managed to reduce 
greenhouse gas emissions? With the advent of 
ESMs, climate science is no longer limited to the 
physical basis for climate projections, but also 
includes projections of the biosphere—for exam- 
ple, regarding carbon storage on land and in the 
ocean, forest dieback, wildfires, crop yield, and 
fisheries and marine resources. 

However, the study of climate change is still 
often parsed into separate activities of observing 
changes and deducing causes (3); assessing the 
vulnerability, impacts, and adaptation (VIA) of 
natural and human systems to these changes 
(6, 7); and determining the socioeconomic trans- 
formations needed to mitigate them (9). The 
untapped potential of ESMs is to bring these 
dispersed activities into a common framework. 
There has been success, for example, in coordi- 
nating climate projections with the integrated 
assessment models that identify the societal trans- 
formations needed to mitigate climate change (J0) 
and even some initial attempts at directly coupling 
ESMs and integrated assessment models (77). As 
terrestrial and marine ecosystems have been added 
to ESMs, the distinction between the physical basis 
for climate change, VIA, and mitigation no longer 
necessarily holds. Land-use and land-cover change, 
for example, is driven by socioeconomic needs for 
food, fiber, and fuel, but is also an ecological prob- 
lem that alters habitat and biodiversity and a means 
to mitigate anthropogenic CO, emissions (4). 

In this Review, we discuss the treatment of the 
biosphere in ESMs, considering terrestrial and 
marine ecosystems as they are now represented 
in the models, exploring how ESMs can be used 
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to study the biosphere, and highlighting oppor- 
tunities for future research. We then describe 
environmental changes that are occurring glob- 
ally and that are stressing terrestrial and marine 
ecosystems and show how these stresses are in- 
cluded in ESMs, in the past primarily with an 
emphasis on climate processes, but now with ad- 
ditional utility for VIA and mitigation research. 
Last, we examine these stresses in the context of 
Earth system prediction. Our list of stressors is 
not meant to be exhaustive. Rather, we highlight 
several key stressors and their coincidence among 
climate processes, VIA, and mitigation with the goal 
of initiating a dialog among the scientific commun- 
ities that study climate change. This Review is 
timely because it identifies synergies across the 
climate and ESM research communities involved 
in the next Coupled Model Intercomparison Project 
(CMIP6) (72), which provides an unparalleled op- 
portunity to model and analyze the Earth system. 


Earth system models 


ESMs simulate physical, chemical, and biological 
processes that underlie climate. They are the 
most complex in the ongoing evolution of global 
models of Earth’s atmosphere, ocean, cryosphere, 
and land (Fig. 1). Climate models focus on the 
physical climate system, as represented by atmo- 
sphere, ocean, and sea ice physics and dynamics 
and land surface hydrometeorology. In climate 
models, land and ocean are coupled with the 
atmosphere through energy and momentum fluxes 
and the hydrologic cycle. ESMs have the same 
representation of the physical climate system but 
additionally include the carbon cycle, terrestrial 
and marine ecosystems and biogeochemistry, at- 
mospheric chemistry, and natural and human 
disturbances. ESMs typically couple distinct com- 
ponent modules for land, atmosphere, and ocean 
physics, and ecosystem dynamics and biogeo- 
chemistry are embedded into these modules. 

A prominent feature of ESMs is their inclusion 
of the biosphere and abiotic interactions that 
together make up an ecosystem. On land, terres- 
trial ecosystems are represented in ESMs by the 
type of vegetation, the amount of leaf area, the 
stomata on leaves, and carbon and nitrogen pools 
(13). Similarly, ESMs simulate ocean phytoplankton 
production of chlorophyll that influences the 
vertical profile of light absorption in the upper 
ocean, which in turn affects model sea surface 
temperature and mixed layer dynamics, as well 
as large-scale ocean circulation, heat transport, 
and climate variability (74, 15). 

Biogeochemical cycles were added to ESMs 
because of the potential for large climate feed- 
backs arising from the carbon cycle. Terrestrial 
ecosystems and the ocean together absorb about 
one-half of the annual anthropogenic CO, emis- 
sions (16), but the future efficacy of these sinks 
is uncertain (77). Biogeochemical processes on land 
encompass spatial scales from leaves to plant cano- 
pies and from ecosystems to landscapes to biomes 
(13). Temporal scales include near-instantaneous 
physiological responses (e.g., stomatal conduct- 
ance, photosynthesis, and respiration) to prevailing 
environmental conditions; the seasonal emergence 
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and senescence of leaves; and changes in eco- | acteristics using cohorts of individuals of similar | ditionally track natural cycling of inorganic carbon, 
system structure and biogeography over decades | functional traits in vertically structured plant | alkalinity, macronutrients (nitrogen, phosphorus, 
and centuries in response to natural disturbances | canopies (18). and silicon), and often O.; net organic matter 
(e.g., wildfires), anthropogenic disturbances (e.g., The three-dimensional carbon cycle models and CaCOs production and export from the sur- 
land-use transitions), and climate change. Ongoing | used to estimate ocean uptake of anthropogenic | face ocean; particle sinking and respiration and 
model development aims to more authentically | CO, evolved from model tracer studies of ocean | remineralization at depth; and air-sea CO, (and Oz) 
represent plant demography and life history char- | physical circulation. Biogeochemical models ad- | gas exchange (19). Plankton ecosystem models 
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Fig. 1. Representation of the biosphere in Earth system models (ESMs). The top = 
panel shows land and ocean as included in climate models, and the bottom panel shows Surface ocean 
the additional processes included in ESMs. ESMs simulate atmospheric COs in response 
to fossil fuel emissions and terrestrial and marine biogeochemistry. Some ESMs also 
simulate atmospheric chemistry, aerosols, and CHy. Terrestrial processes shown on the 
left side of the diagram include biogeophysical fluxes of energy, water, and momentum; Deep consumers 
biogeochemical fluxes; the hydrologic cycle; and land-use and land-cover change (13). 
The carbon cycle includes component processes of gross primary production (GPP), 
autotrophic respiration (Ra), litterfall, heterotrophic respiration (Ry), and wildfire. Carbon 
accumulates in plant and soil pools. Additional biogeochemical fluxes include dust 
entrainment, wildfire chemical emissions, biogenic volatile organic compounds (BVOCs), 
the reactive nitrogen cycle (Nr), and CH, emissions from wetlands. Ocean processes are shown on the right side of the diagram. Physical processes 
include sea ice dynamics, ocean mixing and circulation, changes in sea surface temperature (SST), and ocean-atmosphere fluxes. The gray shaded 
area depicts the marine carbon cycle, consisting of the phytoplankton-based food web in the upper ocean, export and remineralization in the deep sea 
and sediments, and the physiochemical solubility pump controlled by surface-deep ocean exchange (100). 
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that simulate interactions of phytoplankton, zoo- 
plankton, nutrients, and detrital pools arose both 
to drive biogeochemistry models and to character- 
ize marine ecological dynamics, such as seasonal 
phytoplankton blooms. Recent biogeochemical 
developments include incorporation of iron and 
other trace elements (20), iron limitation being a 
major controlling factor for phytoplankton growth 
in much of the ocean; more sophisticated treat- 
ment of marine biological nitrogen fixation and 
denitrification (27); coastal inputs of nutrients; 
and ocean acidification. Major model advances 
under way involve expansion of plankton biological 
complexity to incorporate functional groups, trait- 
based dynamics, and biodiversity (22-24) and 
efforts to integrate simulated plankton produc- 
tivity with fisheries catch (25). 

An active research frontier for ESMs is incor- 
porating more extensive chemistry-climate inter- 
actions. Additional reactive nitrogen affects climate 
through enhanced terrestrial carbon storage, emis- 
sions of NO, and chemical reactions that deter- 
mine the amount of tropospheric Os, CH,, and 
aerosols (5, 26). Atmospheric deposition of nitro- 
gen to the surface ocean can enhance biological 
productivity in low-nutrient subtropical regions; 
globally, however, marine biogeochemistry may 
be more sensitive to anthropogenic iron deposi- 
tion (27, 28). Increased concentrations of tropo- 
spheric O, decrease plant productivity and reduce 
the terrestrial carbon sink, but the appropriate 
way to parameterize this in models is uncertain 
(29, 30). Emissions of biogenic volatile organic 
compounds from terrestrial ecosystems influence 
atmospheric concentrations of O3, CH4, and sec- 
ondary organic aerosols (37). Wetlands are an im- 
portant source of CH,, as are permafrost soils and 
hydrates. Global models of wetlands and CH, 
emissions are being developed (32), and some 
ESMs include methane chemistry in their cli- 
mate projections (33). In the ocean, more di- 
verse biogeochemistry is needed (e.g., trace gases 
such as dimethyl sulfide) in models to link ocean- 
atmosphere chemistry. 

Natural and human disturbances are a con- 
tinuing research priority for ESMs. Wildfires affect 
climate and air quality through emissions of long- 
lived greenhouse gases, short-lived reactive gases, 
and aerosols and by altering surface albedo (34). 
Wildfires are included in ESMs, but our ability to 
model the precise details of fire regimes is limited 
(35). The mountain pine beetle epidemic in west- 
ern North American forests has reduced terres- 
trial carbon uptake (36), increased surface albedo 
(37), and warmed the surface by reducing evapo- 
transpiration (38). Efforts to represent insect out- 
breaks in ESMs are promising but still nascent 
(39). Human disturbances include land-use tran- 
sitions (e.g., deforestation, reforestation, farm 
abandonment, and shifting cultivation) and wood 
harvest (40). Global models of crop growth are 
included in ESMs, but specific cultivars, time of 
planting, crop rotation, and other management 
practices are lacking (41, 42). Nor is forest man- 
agement and commercial timber production in- 
cluded, despite having an effect on temperature 
similar to that of land-cover change (43). 
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Planetary stresses and climate feedbacks 
The inclusion of terrestrial and marine ecosys- 
tems in ESMs enables study of the global change 
stresses on the biosphere and feedbacks with cli- 
mate change (Table 1). A prominent signal on land 
is a “greening” of the biosphere, although this is 
partially countered by increased tree mortality 
and disturbance. Novel community assemblages 
are likely to emerge that depend on the magni- 
tude and rate of climate change and the ability of 
species to adjust to these changes through dis- 
persal. Marine ecosystems also face numerous 
threats from climate change (44-46). Surface ocean 
waters are warming globally and freshening at 
high latitudes; together, these trends act to in- 
crease vertical physical stratification, resulting in 
altered regional patterns of nutrient supply, light 
environment, and phytoplankton productivity. 
Ocean warming leads to shifts in plankton sea- 
sonal phenology and poleward migration of plank- 
ton, invertebrate, and fish species; coral bleaching; 
and sea ice loss in polar marine ecosystems. Cli- 
mate change is projected to alter the spatial pat- 
terns and size of marine wild-caught fisheries (47) 
and may potentially change marine disease out- 
breaks (48). Marine communities and ecosystems 
also may be reorganizing into novel assemblages, 
requiring more sophisticated ESMs that can track 
in more detail the effects of warming on plankton 
community composition and trophic interactions (49). 

Human activities imperil ecosystems and biota 
in ways other than direct climate change (Table 1). 
Additional reactive nitrogen alters biodiversity; 
terrestrial, freshwater, and marine biogeochemistry; 


and water and air quality. Anthropogenic aerosols 
increase the amount of diffuse solar radiation, 
which can enhance terrestrial productivity. High 
concentrations of tropospheric O; can cause stomata 
to close and thus decrease plant productivity and 
transpiration. Vast areas of forests have been cleared 
over the industrial era, and many of the remain- 
ing forests are managed or secondary rather than 
old-growth primary forests. About one-third of the 
ice-free land is covered by cropland or pastureland, 
and much of the terrestrial productivity has been 
appropriated for human uses. 

Marine impacts arise from ocean acidifica- 
tion owing to increasing atmospheric CO, and 
deoxygenation from climate-related circulation 
changes. Regional stresses, particularly on conti- 
nental shelves and some parts of the open ocean, 
are occurring from overfishing, anthropogenic 
noise, seabed habitat destruction, pollution, and 
coastal eutrophication, as well as loss of coastal 
wetlands, mangrove forests, and seagrass beds 
owing to development and sea level rise (50). 


Vulnerability, impacts, and adaptation 


Agriculture and food security, forest and water 
resources, terrestrial ecosystems, and fisheries and 
marine ecosystems are facets of the biosphere that 
sustain socioeconomic well-being. Assessing the 
impact of climate change on these goods and 
services, their vulnerability to disruption in a chang- 
ing climate, and the adaptations needed to main- 
tain their future availability is critical for informing 
sound climate policies (6, 7). Such assessments are 
commonly obtained by using climate projections 


Table 1. Planetary stresses faced by terrestrial and marine ecosystems. 


Stress 


Terrestrial ecosystems 


Earlier springtime and longer growing season 


Greater productivity 


Reference 


Diffuse radiation 


Tree mortality 


Wildfire and insects outbreaks 


Community assemblages 


Marine ecosystems 


Aerosol deposition 
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to drive models of terrestrial ecosystems (57), crop 
yield (52), water availability (53), and fisheries and 
marine ecosystems (54). This indirect, two-step 
approach has deficiencies because archived cli- 
mate model output may not capture variable types 
or temporal resolution needed for some VIA models 
and restricts the ability to study feedbacks on 
climate and biogeochemistry. 

With inclusion of the biosphere in ESMs, VIA 
can be investigated directly. For example, the ESMs 
used to quantify future carbon-climate feedbacks 
can also be used in retrospective studies to assess 
responses of terrestrial and marine ecosystems 
to historical anthropogenic forcings and climate 
variability (16, 55, 56). The ocean component of 
ESMs provides a tool for reconstructing the var- 
jiability, trends, and mechanisms of historical ocean 
biogeochemistry (57) and plankton dynamics (58). 
ESMs with high-resolution ocean circulation models 
can be used to track larval dispersal and connec- 
tivity among coral reefs subject to bleaching (59). 

ESMs provide further opportunity to move 
beyond physical descriptors of atmospheric and 
oceanic states (such as temperature and precip- 
itation) to societally relevant quantities related 
to food, energy, and fresh water. For example, the 
croplands in ESMs allow direct study of the im- 
pacts of climate change on agricultural produc- 
tion, the vulnerability of the food supply to future 
climate disruption, and adaptations to make food 
production more resilient (42). Some ESMs in- 
clude urban land cover, which allows study of 
extreme heat waves and facilitates assessment 
of heat-stress mortality in cities (60). ESMs are 
particularly useful for assessing air quality and 
human health issues because of the interactions 
among agriculture, wildfire, nitrogen gaseous fluxes, 
and biogenic volatile organic compounds that af- 
fect regional air quality. 

Achieving this potential requires effective com- 
munication between the scientists developing ESMs 
and those using climate projections to study VIA 
(61). One result of better collaboration would be 
to identify and reconcile discrepancies between 
ESMs and VIA models, such as are evident in 
their assessments of water availability in a future 
climate. ESMs account for the effects of elevated 
atmospheric CO, on stomatal conductance and 
evapotranspiration, but many VIA models do not, 
resulting in an inconsistency in projections of 
water availability (62). Other examples of processes 
included in ESMs but not VIA models are the ef- 
fect of O3 on stomata (29, 30) and the effect of 
vegetation greening and land use on runoff (63). 
Closer collaboration between the communities 
would help to identify capabilities relevant to 
VIA, define impact-relevant metrics that ESMs 
should produce, and develop data sets and pro- 
tocols for validation of simulated impacts (67). 

ESMs remain just one of several means to 
study VIA. A suite of specialized research tools 
including statistical models and process-based 
crop, ecosystem, and hydrology models is required 
(6, 7). These models have the advantage that they 
can be run at the fine-scale spatial resolution 
needed to inform decision making. Also, they are 
less computationally expensive than ESMs and 
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can therefore be used in an ensemble of simu- 
lations to assess uncertainty. 

ESMs cannot yet represent the rich ecological 
detail needed to capture spatial heterogeneity at 
local scales. Similarly, the ocean ecosystem models 
used in ESMs typically incorporate the lowest 
trophic levels of the marine food web (phyto- 
plankton, herbivorous zooplankton) and have only 
a limited representation of biodiversity. Often, 
ESMs lack the ecological complexity required to 
predict outcomes in higher trophic levels and 
fisheries. The spatial resolution of global models 
is too coarse to capture regional dynamics of highly 
productive coastal ecosystems and coral reefs, and 
models are just beginning to incorporate adequate 
land-ocean connectivity to assess nutrient eutroph- 
ication, water quality, and harmful algal blooms 
(64). Variable-resolution global models with a hor- 
izontal resolution that refines from a 1° global grid 
to a regional 0.125° (14-km) grid help to bridge 
the gap between coarse-scale ESMs and the finer 
scales needed for VIA research (65, 66). 


Climate change mitigation 


Reducing the sources and enhancing the sinks of 
long-lived greenhouse gases are the most direct 
means to mitigate anthropogenic climate change 
(67). However, many interventions that might 
reduce greenhouse gas emissions affect the bio- 
sphere and have other effects on climate and 
ecosystem services. Afforestation, reforestation, 
or avoided deforestation, for example, enhance the 
terrestrial carbon sink, but also warm climate an- 
nually by decreasing surface albedo, cool climate 
through evapotranspiration and turbulent mixing 
with the atmosphere, and have additional effects 
through atmospheric chemistry and aerosols (13, 68). 
These biogeophysical effects can counter the car- 
bon mitigation benefits of forests so that even 
more extensive forested land may be required to 
achieve climate stabilization at a target that avoids 
dangerous climate change (e.g., 2°C). ESMs are an 
imperfect but necessary tool to study the net cli- 
mate effects of forests (68). 

Agriculture is another example of the need to 
consider mitigation in an ESM context. Efficient 
application of nitrogen fertilizer, tillage, and other 
management can enhance carbon storage and re- 
duce N.O emissions (69). Crops also affect climate 
through biogeophysical coupling with the atmo- 
sphere; it is likely that expansion of agricultural 
lands over the industrial era has cooled climate 
because of these changes (70). Intensification of 
agriculture is thought to have cooled summer 
temperatures in the Midwest United States (77). 
No-till agriculture can increase surface albedo and 
cool climate (72), and other increases in surface 
albedo may have geoengineering potential (73). 
Production of bioenergy for carbon capture and 
storage (BECCS) can also mitigate climate change, 
but land use for BECSS must be balanced by ara- 
ble land for food production (74). ESMs provide a 
necessary tool to investigate the multidisciplinary 
outcomes of BECCS for climate, food, energy, and 
fresh water. ESMs are also being used to deter- 
mine the effects on ecosystems of geoengineering 
techniques involving solar radiation modification 
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such as stratospheric aerosol injection, cloud bright- 
ening, and surface albedo manipulation (75, 76). 

At present, the ocean removes roughly a quarter 
of anthropogenic CO, emissions to the atmo- 
sphere, with the magnitude modulated by chem- 
ical dissolution into surface seawater and the 
physical rate of exchange between surface and 
deep waters (16). Over the centuries-long time 
scales of ocean-overturning circulation, an increas- 
ing fraction of anthropogenic CO, will be stored 
in the deep ocean reservoir. Several geoengineer- 
ing approaches have been proposed to increase 
ocean carbon uptake by injecting CO, directly at 
depth, fertilizing phytoplankton to speed up the 
marine biological pump that transports organic 
carbon from the surface layer into the deep ocean, 
and accelerating weathering processes to add al- 
kalinity to seawater (67). A number of studies 
with ocean-only models and full ESMs have ex- 
plored the possible efficacy of these approaches, 
as well as the potential for ecological impacts 
and biogeochemical feedbacks (75). Substantial 
alterations to marine ecosystems could also arise 
from solar radiation modification. 


Earth system prediction 


Atmospheric science has long embraced models 
to make predictions of near-term weather and 
long-term climate. ESMs enable predictions of 
the biosphere, but the atmospheric-centric view 
of prediction needs to be extended to the bio- 
logical realm. In this section, we introduce termi- 
nology and concepts specific to weather forecasts 
and climate prediction and then show extension 
to the biosphere. 

Forecasting the weather on time scales of a few 
hours to 2 weeks is a classic prediction problem 
in which a model that describes the atmosphere- 
land-ocean-sea ice system is stepped forward in 
time from initial conditions. The predictability— 
the capability to make a skillful forecast—is 
limited by uncertainty in the exact initial condi- 
tions, imperfections in the model and under- 
standing of the underlying physics and dynamics, 
and the degree of randomness or chaotic behavior 
in the system. The same concept applies to pre- 
dicting climate at seasonal, interannual, or dec- 
adal time scales (77-79). Climate projection over 
several decades must consider additional long-term 
Earth system processes such as ocean circulation, 
ice sheet melting, and changes in vegetation, ter- 
restrial and marine biogeochemistry, and human 
behavior. The lattermost is particularly poorly 
known and is imposed using anthropogenic 
forcing scenarios. At multidecadal time scales, 
the exact initial state is less critical. Instead, 
uncertainty in climate projections is largely dom- 
inated by the choice of an anthropogenic forcing 
scenario, although uncertainties also remain with 
regard to climate sensitivity and feedback pro- 
cesses (Fig. 2). 

The term Earth system prediction is used to 
capture this spectrum of temporal scales from 
subseasonal to multidecadal, mostly in the con- 
text of weather and climate (77-87). In a broader 
perspective, however, the scope of Earth system 
prediction can be expanded to include other facets 
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of the Earth system. The predictability of Arctic 
sea ice loss is a prominent such example (82). As 
climate models have evolved into more complex 
ESMs, the predictability of biosphere states and 
processes needs to be considered jointly with 
that of weather and climate. 

The predictive capability of a model depends 
on the sources of errors in the forecast. For cli- 
mate, these errors are broadly classified into ini- 
tial conditions, boundary conditions, and model 
uncertainty, including both model structure and 
parameters (83). Uncertainty in exact initial con- 
ditions manifests in unforced variability internal 
to the climate system (also known as natural var- 
jability), in which small differences in initial con- 
ditions produce different climate trajectories. The 
importance of natural variability can be assessed 
through a multimember ensemble of simulations 
with a single model initialized with different states. 

The second source of uncertainty is model error, 
seen in the model response to the imposed forc- 
ing scenarios. Models are imperfect and differ in 
their forced response owing to their spatial reso- 
lution and imprecision in their parameterization 
of the various physical, chemical, and biological 
processes. Model uncertainty is assessed through 
multimodel ensemble studies. 

The third source of uncertainty is the forcing 
scenarios and their depiction of the time evolution 
of greenhouse gases, land use, and other anthro- 
pogenic forcings of climate, which are imposed 
as boundary conditions to the models. For tem- 
perature projections at the global scale, model 
uncertainty and natural variability dominate at 
near-term decadal time scales (10 to 30 years) (83). 
Scenarios are the major source of uncertainty at 
multidecadal lead times. Total uncertainty is larger 


Sources of 
uncertainty 


Initial 
condition 


Model 
uncertainty 


Scenario 
uncertainty 


at regional scales, mostly from natural variability 
and model structure. 

A related concept is time of emergence. Deter- 
mining the time at which the forced climate 
change signal emerges from the noise of natural 
variability is a necessary requirement in assess- 
ing when an expected change can be detected or 
whether observed changes can be attributed to 
anthropogenic forcings (84, 85). Time of emer- 
gence has been mostly studied for temperature 
and can vary from a few decades in mid-latitudes 
with low natural variability to several decades or 
longer in regions with larger natural variability. 

Can these concepts of predictability, uncer- 
tainty, and time of emergence be extended to 
study the biosphere in the Earth system? ESMs 
predict prominent changes in terrestrial and 
marine biogeochemistry, but only recently have 
the necessary large, multimodel and multimem- 
ber ensembles become available to distinguish 
the forced response from natural variability and 
model uncertainty. Such analyses give important 
insights into the use of ESMs to understand changes 
in Earth system biogeochemistry. In addition to 
warmer temperatures, the ocean has trends of 
increased carbon uptake, acidification, lower O., 
and reduced net primary production over the next 
several decades in the absence of mitigation (86). 
The forced trend in air-sea CO, flux emerges 
rapidly in some ocean regions, but large natural 
variability precludes detection of changes in the 
rate of carbon uptake until mid-century or later 
in many regions (87). Other aspects of ocean bio- 
geochemistry such as pH, O, concentration, and 
net primary production also have large, region- 
ally dependent natural variability (88-90). Ocean 
acidification has a rapid time of emergence driven 


Initial value problem 
Subseasonal to seasonal forecast 
(2 weeks — 12 months) 


Decadal prediction 
(1-30 years) 


Earth system projection 
(30 - 100+ years) 
Boundary value problem 


Fig. 2. Schematic depiction of Earth system prediction of the biosphere. The synergies between 
climate feedback processes, internal climate variability, and ecosystem impacts determine model 
outcomes. Subseasonal to seasonal forecasts and decadal climate prediction are initial value problems. 
Earth system projections are a boundary value problem driven by anthropogenic forcing scenarios. 
Uncertainty arises from inexactness of initial conditions, model imperfections, and scenarios. 
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by the accumulation of anthropogenic CO, in the 
surface layer, and the sea surface temperature 
warming signal also emerges within a few decades 
in many regions, but forced changes in O, con- 
centration and net primary production do not 
emerge from natural variability until mid- to late- 
century, if at all (Fig. 3, A to C). 

Time of emergence has been less studied in 
the terrestrial biosphere. Observational and model- 
ing analyses support an enhanced terrestrial car- 
bon sink arising from global change (17, 56, 91). 
Unforced variability in the land-atmosphere CO, 
flux is large and precludes detection of change at 
decadal time scales (92). There is considerable 
variability within and among models, and the 
forced response statistically emerges only after 
several decades in many regions of the world. 
The HadGEM2-ES model, for example, shows the 
forced signal of terrestrial carbon gain as emerg- 
ing from internal variability in many regions by 
2030, but other models show a weaker signal 
that has yet to statistically emerge, and even car- 
bon loss rather than gain (Fig. 3, D to F). 

The various contributions to uncertainty differ 
depending on the quantity of interest, lead time, 
and spatial scale. The uncertainty from natural 
variability is particularly large at small spatial 
scales and short lead times for pH, O, concen- 
tration, and net primary production in the ocean 
(89) and air-sea carbon flux (93). By the end of 
the 21st century, scenario uncertainty dominates 
total uncertainty for these states and fluxes at the 
global scale (except for net primary production), 
but natural variability and model uncertainty re- 
main large at the regional or biome scale. Simu- 
lations of the terrestrial carbon cycle are much 
more variable among models and largely domi- 
nated by model uncertainty (94). Comparisons of 
ocean and land carbon cycle projections point to 
a markedly different assessment of uncertainty 
(Fig. 4). For ocean carbon uptake, model uncer- 
tainty is initially large, but scenario uncertainty 
dominates during the latter part of the 21st century. 
In contrast, model structure contributes 80% of 
total uncertainty throughout the 21st century for 
the terrestrial carbon cycle. 

Much of the study of Earth system prediction 
is focused on climate, and decadal climate predic- 
tion is particularly focused on the dynamics and 
thermal characteristics of the ocean because of 
its prominent role in climate variability. When 
soil moisture and vegetation are considered by 
atmospheric modelers, it is often in the context 
of how these affect climate predictability, rather 
than whether they can be predicted (79). In a 
global change perspective, ecological predictions 
are as essential as those of the physical climate 
system. ESMs provide the means not just to as- 
sess the potential for future stresses engendered 
by a changing climate, but also to determine the 
outcome of those stresses on crop yield, tree 
mortality, fisheries, and other aspects of the 
biosphere. For example, characterizing when 
and where wildfires might occur would be valu- 
able to aid governmental agencies charged with 
wildfire protection. Particular fire events are nearly 
impossible to forecast, especially because so many 
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Fig. 3. Ocean and land forced 


trends relative to internal vari- 
ability and model uncertainty. 
Data are from a range of 

ESMs contributed to CMIP5. 

(A to C) Multimodel time of 
emergence for SST, Oz, and net 
primary production (NPP) (89). 
Time of emergence is defined as 
the year at which the signal 
exceeds the noise, which, as used 
here, includes both internal vari- 


HadGEM2-ES 


ability and model uncertainty. The 
forced SST signal emerges rapidly 


we 


Bt. 


in many locations, O» time of 
emergence is regionally variable, 
and the forced NPP signal does 
not statistically emerge by 2100. 
(D to F) Signal-to-noise ratio for 
cumulative land carbon uptake in 
a business-as-usual scenario at 
2030 for three different ESMs 
(92). Positive (negative) values 
indicate carbon gain (loss). In 
these panels, the noise is strictly 
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fires are caused by people, but fire behavior on 
seasonal time scales can be forecast on the basis 
of relationships with sea surface temperatures 
(95). Prediction of future fire behavior requires 
models that accurately depict fire occurrence 
and severity, but wildfire prediction will also 
require an understanding of the predictability 
of the climate that drives fire behavior. A similar 
argument pertains to crop yield, marine resources, 
and dust emissions simulated in the current gen- 
eration of ESMs and to forest mortality and habit 
loss that will be simulated by the next generation 
of models. Such forecasts may be particularly rel- 
evant at the subseasonal to seasonal time scale (79). 


Research needs 


As this Review highlights, the biosphere is central 
to understanding why and how the Earth system 
is changing and to adapting to and mitigating 
future changes. Many of the global change stress- 
ors that terrestrial and marine ecosystems face 
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internal variability, and a ratio 
greater than 2 or less than -2 


indicates that the signal has 
emerged from the internal varia- 
bility. There are considerable dif- 
ferences among models in the 
sign of the terrestrial carbon flux 
and whether the change has 
emerged from natural variability 
by 2030. CCSM4, Community 
Climate System Model version 4; 
HadGEM2-ES, Hadley Centre 
Global Environmental Model 
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need to be understood not only for their impacts 
on ecosystem services that are essential to human- 
kind, but also as processes that affect the mag- 
nitude and trajectory of climate change. A strategy 
is needed to extend the study of subseasonal and 
seasonal forecasts and decadal climate prediction 
to amore multifaceted Earth system prediction, 
including the biosphere and its resources. The ex- 
tension of seasonal to decadal climate forecasts to 
living marine resources, for example, has consid- 
erable potential to aid marine management (96). 
A similar extension to terrestrial ecosystems would 
aid land resource management. 

Toward this goal, this Review has presented 
several pathways to further define Earth system 
prediction. First is continued advancement of 
terrestrial and marine science in light of climate 
processes and the many ways in which the 
biosphere influences climate. A prominent exam- 
ple is the carbon cycle, its feedback with climate 
change, and whether terrestrial and marine eco- 
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version 2; CanESM2, second- 
generation Canadian Earth 
System Model. 


systems can be purposely managed to mitigate 
anthropogenic CO» emissions. There is consid- 
erable uncertainty in ocean carbon cycle pro- 
jections, particularly at regional or biome scales 
(89, 93), and land carbon model uncertainty pre- 
cludes distinguishing among various alternative 
scenarios (94). Moreover, if planting forests and 
biofuels are essential to maintaining atmospheric 
CO, concentrations within some planetary warm- 
ing target, how confident are we in our ability 
to know the net climate outcome of these pol- 
icies (68)? 

The current generation of ecosystem models 
are abstractions of complex systems. Many eco- 
logical and biogeochemical processes are rep- 
resented, but the challenge of representing the 
biosphere—with its rich diversity of life forms, 
their assemblage into communities and ecosys- 
tems, and the complexity of ecological systems—is 
daunting, as is evident in large model uncertainty 
in terrestrial carbon cycle projections. Theoretical 
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Fig. 4. Ocean and land carbon cycle 
uncertainty. The percentage of total variance 
attributed to internal variability, model 
uncertainty, and scenario uncertainty in 
projections of cumulative global carbon uptake 
from 2006 to 2100 differs widely between 

(A) ocean and (B) land. The ocean carbon cycle 
is dominated by scenario uncertainty by the 
middle of the century, but uncertainty in 

the land carbon cycle is mostly from 

model structure. Data are from 12 ESMs 

using four different scenarios (94). 


advances are needed, but there may be a limit to 
how much model uncertainty can be reduced (94). 
More complexity does not necessarily lead to bet- 
ter predictions or reduce uncertainty. 

A second pathway is to better integrate ESMs 
and VIA models. The gap between models arises 
from disciplinary expertise (atmospheric and ocean 
sciences for ESMs and hydrology, ecology, biogeo- 
chemistry, agronomy, forestry, and marine sciences 
for VIA models), but effective communication 
among, rather than across, disciplines is not trivial. 
There are also pragmatic considerations, partic- 
ularly with regard to spatial scale and process 
complexity, that limit collaboration between global 
ESMs and VIA models with a more local to re- 
gional domain. However, just as the science of 
Earth system prediction is seen as a means to unite 
the weather and climate modeling communities 
(80, 81), so, too, can the broadening of Earth system 
prediction to include the biosphere stimulate col- 
laborations with the VIA community. 

A third promising research pathway is to expand 
the concepts and methodology of seasonal to dec- 
adal climate prediction to include terrestrial and 
marine ecosystems and to quantify prediction un- 
certainty at spatial and temporal scales relevant 
to stakeholders. The predictability of the terres- 
trial carbon cycle can be considered from an eco- 
logical perspective (97), but only recently has it 
been considered in an Earth system perspective 
of natural climate variability, the forced climate 
response, and model uncertainty (92, 94). Anal- 
ysis of natural variability, model uncertainty, and 
scenario uncertainty is similarly informing marine 
biogeochemistry (87-90, 93). Whether the bio- 
sphere is a source of climate predictability is not 
necessarily the right question to pose. A more 
fruitful research pathway may be to investigate 
how to predict the biosphere and its resources 
in a changing environment, as identified specif- 
ically for marine living resources (96) and con- 
sidered also for atmospheric CO, (98). Initial 
condition uncertainty and the difficulty in separat- 
ing natural variability from the forced trend likely 
produces irreducible uncertainty in climate pre- 
diction (99). At the regional or biome scale, nat- 
ural variability is large for the ocean and land 
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carbon cycles (89, 92, 93). Whether a similar ir- 
reducible uncertainty manifests in terrestrial and 
marine ecosystems remains to be explored. 

With their terrestrial and marine ecosystems, 
biogeochemical cycles, and simulation of plants, 
microbes, and marine life, ESMs challenge ter- 
restrial and marine ecologists and biogeochem- 
ists to think in terms of broad generalizations 
and to find the mathematical equations to de- 
scribe the biosphere, its functioning, and its re- 
sponse to global change. ESMs similarly challenge 
geoscientists to think beyond a physical under- 
standing of climate to include biology. The models 
show much promise to advance our understand- 
ing of global change but must move from the 
synthetic world of an ESM toward the real world. 
Bridging the gap between observations and theory 
as atmospheric CO, rises, climate changes, more 
nitrogen is added to the system, forests are cleared, 
grasslands are plowed or converted to pastures, 
coastal wetlands and coral reefs are degraded or 
lost, and oceans warm and are increasingly polluted 
poses challenging opportunities for the next gener- 
ation of scientists to advance planetary ecology and 
climate science. 
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NEAR-FIELD COSMOLOGY 


A whirling plane of satellite galaxies 
around Centaurus A challenges 
cold dark matter cosmology 


Oliver Miiller,’* Marcel S. Pawlowski,” Helmut Jerjen,® Federico Lelli* 


The Milky Way and Andromeda galaxies are each surrounded by a thin plane of satellite 
dwarf galaxies that may be corotating. Cosmological simulations predict that most 
satellite galaxy systems are close to isotropic with random motions, so those two well- 
studied systems are often interpreted as rare statistical outliers. We test this assumption 
using the kinematics of satellite galaxies around the Centaurus A galaxy. Our statistical 
analysis reveals evidence for corotation in a narrow plane: Of the 16 Centaurus A satellites 
with kinematic data, 14 follow a coherent velocity pattern aligned with the long axis of their 
spatial distribution. In standard cosmological simulations, <0.5% of Centaurus A-like 
systems show such behavior. Corotating satellite systems may be common in the universe, 
challenging small-scale structure formation in the prevailing cosmological paradigm. 


he presence of planes of satellite dwarf gal- 

axies around the Milky Way (/-4) and An- 

dromeda (5-7) galaxies has challenged our 

understanding of structure formation on ga- 

lactic and subgalactic scales. Similar struc- 
tures are rare in galaxy formation simulations based 
on the standard Lambda cold dark matter (ACDM) 
cosmological model, which predicts close to iso- 
tropic distributions and random kinematics for 
satellite systems (8). The existence of planes of 
satellite galaxies around these two largest galaxies 
in the Local Group is difficult to explain within 
the ACDM framework. Some authors have argued 
that preferential accretion of satellites along fila- 
ments may explain such flattened structures (9). 
Others suggest that the Local Group should be 
considered a rare exception in an otherwise suc- 
cessful cosmological model (70-12). This interpre- 
tation, however, has been challenged by emerging 
evidence for anisotropic satellite distributions 
around massive galaxies beyond the Local Group 
(13, 14). 

The cosmic expansion of the Local Void (a vast, 
empty region of space adjacent to the Local Group) 
has been suggested as a possible origin for the 
formation of these planar structures (75). An issue 
that is mostly ignored in this context is the co- 
herent kinematics of the satellite galaxies, which 
are likely corotating around their host. This is 
clear for the Milky Way (J6, 17), where accurate 
proper motions are available for several satellites, 
but it remains more uncertain for Andromeda 
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(7) because only velocities projected along the 
line of sight (LoS) are measurable. Such orderly 
kinematic motions are extremely rare in high- 
resolution cosmological N-body simulations (78) 
and statistically should not be observed in typ- 
ical galaxy groups. It remains unclear whether 
such planes of satellites are unique to the Local 
Group or ubiquitous in the nearby universe. 

In this Research Article, we study the gal- 
axy group in the constellation Centaurus. The 
Centaurus Group is the richest assembly of galaxies 
within a distance of 10 megaparsecs (Mpc) from 
the Milky Way, the so-called Local Volume (9, 20). 
It comprises two concentrations: the Cen A sub- 
group dominated by a radio-active elliptical gal- 
axy Centaurus A (Cen A, NGC 5128) at a distance 
of 3.8 Mpc, and the M 83 subgroup dominated 
by a late-type spiral galaxy M 83 (NGC 5236) at a 
distance of 4.9 Mpc (19, 20). The galaxies, which 
are gravitationally bound to Cen A, were claimed 
to be distributed in two parallel planes (27). The 
discovery of additional satellite galaxies in the 
group weakened the case for a double-planar 
structure, and a single-plane interpretation has 
become more statistically significant (22-24). This 
plane has a small-scale height with a root-mean- 
square (RMS) thickness of 69 kiloparsecs (kpc) 
and a major-axis RMS length of 309 kpc (25). We 
investigate the kinematics of this planar struc- 
ture and compare it with galaxy formation simu- 
lations in ACDM cosmology. 


Dynamics of the Cen A satellite system 


From Earth, the satellite plane around Cen A is 
seen nearly edge-on at an inclination of 14.6° (25). 
This coincidental geometrical alignment allows 
us to scrutinize the kinematics of the plane. We 
use all available heliocentric velocities for the 
Cen A satellites, taken from the Local Volume cat- 
alog (19, 20). The vast majority of satellites have 
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accurate distances derived from the tip magnitude 
of the red giant branch (TRGB) method with a 
typical uncertainty of ~5%. There are 31 confirmed 
satellites of Cen A with accurate distance mea- 
surements. Half of them have measured LoS ve- 
locities. One sample galaxy (KKs 59) has a measured 
velocity but lacks a TRGB distance: We adopt 
the same distance as for Cen A; excluding this 
galaxy does not change our results. The adopted 
data are listed in table S1. 

The on-sky distribution of the satellites is plotted 
in Fig. 1 together with their motions relative to Cen 
A. Figure 1 also shows the positions and kinematic 
information for 1239 planetary nebulae (PNe) (26) 
and the three-dimensional (3D) distribution of the 
satellites with measured velocities. The mean ve- 
locity of the Cen A satellite system (555 km s~') 
is equal to the recession velocity of Cen A (556 + 
10 km s“) within the measurement uncertainties. 
Hereafter, the recession velocity of Cen A is used 
as a zero-point reference, and the terms “approach- 
ing” and “receding” are intended with respect 
to this velocity. The dust lane of Cen A serves as 
a natural dividing line: Its position angle (PA = 
110°) roughly coincides with the geometrical mi- 
nor axis of the satellite plane (25). Clearly, ap- 
proaching and receding satellites tend to lie to 
the southwest and northeast of the dividing line, 
respectively, indicating a kinematically coherent 
structure. 

To determine the statistical significance of the 
kinematic coherence, we compare the velocities 
of Cen A satellites to a random phase-space distri- 
bution. Every galaxy has a 50% chance of approach- 
ing or receding along the LoS. The probability of 
finding at least 14 out of 16 galaxies with coherent 
velocity movement is 0.42%. Consequently, the 
observed velocity pattern of the Cen A satellites 
is statistically different from a random phase- 
space distribution at the 2.60 confidence level. 

Figure 2 shows the heliocentric velocities of 
the satellite galaxies as a function of their dis- 
tances to Cen A. The geometrical minor axis of 
the plane (or equivalently the dust lane) is used to 
assign a positive or negative sign to the distance 
between satellite galaxies and Cen A. Figure 2 
shows a clear trend: Galaxies to the south of 
Cen A are approaching, whereas galaxies to the 
north are receding. This is to be expected if the 
satellites are rotating around Cen A. Only two 
satellite galaxies (KK 221 and ESO 269-058) deviate 
from this trend and may potentially be counter- 
rotating, analogous to the Sculptor dwarf in the 
Milky Way halo (27). An inspection of their prop- 
erties and alignment inside the plane does not 
reveal any peculiar characteristics (e.g., they are 
not more massive or luminous than other sat- 
ellites). The velocity field of the planetary nebulae 
within Cen A follows a similar trend: Planetary 
nebulae in the northern and southern hemisphere 
are (on average) systematically blue and red shifted, 
respectively. 

To explore the observed velocity pattern for 
the satellite galaxies, we ran three statistical tests: 
namely, Pearson’s R, Spearman’s Rho, and Kendall’s 
Tau. These are standard methods to test corre- 
lations between independent variables. Whereas 
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Fig. 1. On-sky and 3D distribution of 
the satellite system. (A) The on-sky 
distribution of the Cen A subgroup. 
The central image of Cen A has been 
scaled up by a factor of 5 to illustrate 
the features of the host galaxy. Blue 
downwards- and red upwards-pointing 
triangles show approaching and 
receding satellite galaxies with respect 
to Cen A velocity, respectively. Open 


circles are group member candidates; 8 ~40 
filled circles are confirmed satellites S 
without velocity measurements. & 
The line that optimally separates 3 
the approaching and receding satellites SG wie 
is indicated with the wide gray band; a 


it coincides with the dust lane of 

Cen A. (B) The kinematic distribution 
of 1239 planetary nebulae (26). Blue 
PNe are approaching, and red PNe are 


receding relative to Cen A. (C) A 3D ee 
representation of the spatial distribution 

of the Cen A satellite galaxies in 

equatorial Cartesian coordinates (with 

the origin at Earth). The length of the -55 


colored lines is proportional to the 
observed velocity; the dashed line is 
our line of sight toward Cen A. 


the Pearsons’s method tests for a strictly linear 
correlation, the Spearman’s Rho and Kendall’s 
Tau methods test for a general correlation be- 
tween the variables. The null hypothesis is that 
velocities and separations are uncorrelated. The 
velocity pattern is significant within a confidence 
interval of 20 (P value < 0.03) for the projected 
separation and 30 (P value < 0.01) for the 3D 
separation (28). These low P values lead us to 
reject the null hypothesis, implying a small chance 
of finding such a correlation in random, normal 
distributed data. We further consider how much 
more likely the hypothesis of correlated data is 
with respect to the hypothesis of uncorrelated 
data. We applied a Bayesian correlation test (29) 
and found that the scenario of coherently mov- 
ing satellites is 4.5 times more likely using the 
projected separation and 16.5 times more likely 
using the full 3D information than uncorrelated 
satellite movements (28). Projected separations 
consistently give lower statistical significance than 
3D distances because they contain less physical 
information: This highlights the importance of 
having TRGB distance measurements for dwarf 
galaxies in Centaurus. 


Implications for galaxy formation 


The satellite galaxies in the Cen A subgroup col- 
lectively form a coherent kinematical structure. 
Comparable structures have been discovered in 
the Milky Way halo, where the majority of the 
11 classical satellites share a coherent orbital mo- 
tion (established with proper motion measurements 
of individual stars from the satellites) (16), and 
for the Andromeda galaxy, for which 13 out of 15 


A 200kpc 


satellites follow a coherent LoS velocity trend (7). 
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Although we find that the kinematics of the 
Cen A satellites are unlikely to occur by chance, 
this does not immediately allow us to draw con- 
clusions about its agreement with predictions 
from ACDM cosmology. Satellite galaxy sys- 
tems in cosmological simulations generally exhib- 
it some degree of phase-space coherence, owing 
to the accretion of subhalos from preferred di- 
rections, along filaments and in groups (9). To 
judge whether this effect is sufficient to explain 
the observed coherence in the Cen A satellites, 
we determined the occurrence of such extreme 
structures in two cosmological simulations: 
Millennium II (30) and Illustris (32). Millennium II 
is a dark-matter-only N-body simulation that in- 
cludes gravitational effects such as subhalo accre- 
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tion from filaments, but neglects baryonic effects 
such as stellar and black-hole feedback and pos- 
sible destruction of satellite galaxies due to the 
enhanced tidal effects from the baryonic disk (32). 
The relative importance of these effects is highly 
debated (33-36). Hence, we also analyze the hy- 
drodynamical Illustris simulation (32), which addi- 
tionally includes gas physics, star formation, and 
feedback processes. 

Our approach is analogous to recent studies of 
the frequency of the satellite planes around the 
Milky Way and the Andromeda galaxy (18, 37). 
We identify Cen A analogs within the simulations 
by selecting dark matter halos with masses be- 
tween 4 x 10” and 12 x 10” solar masses (Mo) 
and by rejecting any candidate hosts that have a 
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Fig. 2. Velocities and separations to Cen A. Heliocentric velocities versus angular (A) and 3D 

(B) distances A from Cen A (black sphere), in the north (positive A) or south (negative A) of the dust 
lane. Large and small spheres show, respectively, satellite galaxies and planetary nebulae. Blue 
and red colors indicate, respectively, approaching and receding objects with respect to the Cen A 
velocity. The angular distances of the PNe are scaled up by a factor of 10. 
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Fig. 3. Comparison to 
ACDM cosmological @ 
simulations. The a 
number of kinemati- 2 
cally correlated satel- 


frat = 19.5% 


N(b/a) 


Millennium-ll 
eae 


lites Ncorr and the 
on-sky axis-ratio 
flattening a are plotted 15) 
for Cen A analogs from 
the Millennium II (A) 
and Illustris (B) simu- 13} 
ations. The density of 
simulated systems is 
indicated by the color 11} 
map. The contours 
show the frequency of 
more extreme realiza- 9F 
tions, i.e., the number 3l 
of points to the upper 
eft of each position 
divided by the total 
number of realizations. 


Neorr 


The top and right histograms show the number of realizations with a given 
axis ratio N(b/a) and a given number of correlated velocities N(Neorr), 
respectively. The green areas delineate the regions in which systems are 
as extreme as or more extreme than the observed one: flat. feorn AN fhoth 


companion galaxy with dark matter halo mass 
>1x 10” Mo within 1.4-Mpc distance. We require 
a simulated galaxy-satellite system to fulfill two 
simplified criteria to be considered similar to the 
observed system: (i) The projected on-sky axis 
ratio of the system must be b/a < 0.52, where 
a and b are the semimajor and semiminor axes, 
respectively; and (ii) the kinematic coherence 
along the long axis is at least 14 out of 16 sat- 
ellites. We find that the occurrence of arrange- 
ments similar to those of Cen A in the cosmological 
simulations is 0.1% for Millennium II and 0.5% 
for Illustris (Fig. 3). These estimates must be 
considered upper limits, because we do not take 
into account the full 3D distribution of satel- 
lite galaxies. Even though the hydrodynamical 
Illustris simulation does contain a higher fre- 
quency of systems analogous to Cen A than the 
dark-matter-only Millennium II simulation, they 
are rare cases in both. The observed Cen A sat- 
ellite system is thus in serious tension with the 
expectations from these ACDM simulations, to a 
similar degree as the satellite planes in the Local 
Group. 

Could the coherent motion be the result of 
cosmic expansion? If that were the case, a cor- 
relation between the velocities of the satellites 
and their distances to the Milky Way would be 
expected. This is not found for the sample of 
Cen A satellite galaxies (Fig. 1C). We thus can 
rule out that the cosmic expansion is responsible 
for the observed velocity field. Another possible 
origin of a velocity gradient is a perspective ef- 
fect. For angular offsets 6 along the direction of 
motion, a fraction sin(S) of a system’s bulk tan- 
gential velocity is projected along the LoS (38, 39). 
The velocity gradient found for the Cen A system 
in Fig. 2 implies a tangential velocity on the order 
of 1000 km s™’, comparable to what would be re- 
quired for the Andromeda satellite plane (7). This 
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is unphysically high given that it exceeds the cos- 
mic expansion at the distance of Cen A by a factor 
of 3.7. Such an interpretation of the velocity gra- 
dient would imply that the Cen A group moves in 
the direction defined by the satellite plane, which 
is unlikely. Another potential systematic issue is 
the contribution to the LoS velocity by the motion 
of the Sun around the Galactic center. However, 
we found that this contribution is negligible: —2 
to —4km s“/, depending on the sky position, well 
within the uncertainties of the heliocentric ve- 
locity measurements (table S1). 

The coherent kinematics of the Cen A sat- 
ellites is, instead, best explained by corotation 
within the plane. We explored a toy model with 
purely circular orbits and tried to deproject the 
LoS velocities into circular velocities (28). The 
results were unsatisfactory as many satellites 
would have unrealistic circular velocities, which 
randomly vary with the distance from Cen A. This 
suggests that the satellites must be on elliptical 
orbits, as expected for collisionless objects. Two 
galaxies do not follow the general trend: They 
may be counterrotating, or on highly elliptical 
orbits, or simply unrelated to the planar struc- 
ture. Planetary nebulae provide additional evi- 
dence (40): They also show coherent motion, 
although this is less pronounced than for the 
dwarf satellites: Only 65% of PNe partake in 
the common motion (Fig. 1). Because the same 
trend is present in two independent populations 
of objects with different orbital times, we can 
expect this correlation to be long-lived and thus 
indicative of corotation within the planes. 

Corotation outside the Local Group has been 
investigated with satellite galaxy pairs on oppo- 
site sides of their hosts (47). The LoS velocities of 
satellite pairs are preferentially anticorrelated, 
suggesting a high incidence (>50%) of corotating 


satellite pairs in the universe (42), although that 
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footh = 0.5% feorr = 1.6% 


give the frequency of realizations that are, respectively, at least as 
flattened as the observed system, at least as kinematically correlated, or 


fill both conditions simultaneously to reproduce 
of the Cen A satellite system. 


result remains controversial (42-44). For the Cen A 
subgroup, the presence of a plane of satellite 
galaxies is known independently of velocity in- 
formation and is established by using multiple 
group members. This is unlike previous studies, 
which could not determine whether specific pairs 
of satellites actually lie in a plane (41). 

In alternative frameworks for the formation 
of dwarf galaxies, corotating planes of satellites 
could be a consequence of past interactions and 
mergers between disk galaxies (16). During gal- 
axy mergers, tidal tails form from disk material 
as a result of angular momentum conservation 
and can collapse into tidal dwarf galaxies (45-47). 
Hydrodynamical simulations show that these may 
survive the interaction and begin orbiting around 
the central merger remnant as dwarf satellites 
(48, 49). In the Local Group, a major merger form- 
ing the Andromeda galaxy has been proposed as 
a possible origin of the observed satellite galaxy 
planes around both the Milky Way (50) and the 
Andromeda galaxy (57). The recent finding of 
a correlation between the size of spiral galaxy 
bulges (thought to form via major mergers) and 
the number of satellites is in agreement with this 
picture (52). Even the existence of some counter- 
rotating satellites can be understood in this frame- 
work (27). 

Here, we have provided evidence for a kine- 
matically coherent plane of satellite galaxies 
around Cen A, demonstrating that the phenom- 
enon is not restricted to the Milky Way and An- 
dromeda galaxies. The kinematic coherence can 
be understood if the satellites are corotating 
within the plane, as seen around the Milky Way. 
Considering that the likelihood of finding a sin- 
gle kinematically coherent plane is <0.5% in cos- 
mological ACDM simulations, finding three such 
systems in the nearby universe seems extremely 
unlikely. 
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Splicing by the spliceosome involves branching and exon ligation. The branching reaction leads to the 
formation of the catalytic step | spliceosome (C complex). Here we report the cryo-EM structure of the 
human C complex at an average resolution of 4.1 A. Compared to the structure of the S. cerevisiae C 
complex, the human complex contains 11 additional proteins. The step | splicing factors CCDC49 and 
CCDC94 (Cwc25 and Yju2 in S. cerevisiae, respectively) closely interact with the DEAH-family 
ATPase/helicase Prp16 and bridge the gap between Prp16 and the active site RNA elements. These 
features, together with structural comparison between the human C and C* complexes, reveal 
mechanistic insights into ribonucleoprotein remodeling and allow proposition of a working mechanism for 


the C-to-C* transition. 


Each cycle of pre-mRNA splicing, executed by the spliceo- 
some, proceeds in two sequential steps of transesterification: 
branching and exon ligation (J, 2). The products of the 
branching reaction are an intron lariat-3'-exon intermediate 
and a 5’-exon, both of which remain bound to the catalytic 
step I spliceosome (C complex). The ribonucleoprotein re- 
modeling of the C-to-C* transition is driven by the RNA- 
dependent DEAH-family ATPase/helicase Prp16 (3), which is 
thought to bind and pull the single-stranded RNA sequences 
in a 3’-to-5’ direction (4). 

The first atomic model of an intact spliceosome was gen- 
erated in 2015 from the cryo-electron microscopy (cryo-EM) 
structure of the S. pombe intron lariat spliceosome (ILS) at 
3.6 A resolution (5, 6). Since 2016, a burst of structural ad- 
vances has allowed atomic visualization of the S. cerevisiae 
and human spliceosomes (7-20). Here we report the cryo-EM 
structure of the human C complex at an average resolution of 
4.1 A. Compared to the yeast C complex (10, 11), the structure 
of the human C complex contains 11 additional proteins in- 
cluding 4 peptidyl] prolyl isomerases (PPIs), the exon junction 
complex (EJC), U5-40K, RBM22, and Aquarius. The struc- 
tural features and comparison with the human C* complex 
(19, 20) reveal mechanistic insights into the transition from 
the C to the C* complex. 


Cryo-EM analysis 

The human spliceosomes were assembled using HeLa S3 nu- 
clear extract and a synthetic pre-mRNA. Following a pilot ex- 
periment (fig. S1A), we chose to use the small molecule 
BN82685, known to inhibit exon ligation (21), at a final con- 
centration of 250 uM in the splicing assay. The spliceosomes 
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were purified and chemically crosslinked. The spliceosome- 
containing fractions were pooled on the basis of RNA analysis 
(fig. SIB), and the sample was examined by EM under nega- 
tive staining (fig. SIC) and cryogenic condition (fig. S1D). Mi- 
crographs were collected on a Titan Krios_ electron 
microscope, yielding 1,464,033 auto-picked particles. 

A subset of 157,388 particles was used to generate the ini- 
tial references for the human spliceosomal complexes (fig. 
S2), which were applied to the entire dataset. Following 
three-dimensional classifications, 95,064 particles gave a re- 
construction of the human C complex at 4.5 A resolution (fig. 
83). Further selection of 53,633 particles yielded a reconstruc- 
tion at an average resolution of 4.1 A (figs. S3 to 87 and tables 
Sl and S2). The local resolutions vary greatly in the human C 
complex (fig. $4), reaching 3.8-4.2 A in the core but consider- 
ably lower in the peripheral regions (table $2). A soft mask 
on the Brr2 region improved the local EM map to an average 
resolution of 6.5 A (figs. S3 and S7 and table $2). 


Overall structure 

The refined model of the human C complex contains 15,479 
amino acids from 47 proteins and 414 nucleotides from three 
snRNAs (U2, U5, and U6) and the pre-mRNA (Fig. 1A and ta- 
bles S1 and 82), with a combined molecular mass of about 1.8 
mega-Daltons. Approximately two-thirds of the 15,479 amino 
acids have side chains and the rest were built as Ala. The poly- 
Ala sequences are mostly assigned to proteins in the periph- 
eral regions, including Prp16 and a few components (Prp19 
tetramer, Spf27, Syfl and part of Cdc5) of the NineTeen com- 
plex (NTC, also known as the Prp19-CDC5L complex). The 47 
proteins in the atomic model include 11 from U5 snRNP, 9 
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from U2 snRNP, 7 from the NTC, 6 from the NTC-related 
(NTR), 4 from the EJC, 5 splicing factors (SRm300, Cwc22, 
CCDC49, CCDC94, and Prp17), 4 PPIs (tentatively assigned as 
PPIL1, CypE, PPIG, and PPWD1), and Prp16 (Fig. 1A). 

Compared to the yeast C complex (0, 11), the human C 
complex adopts a similarly extended, highly asymmetric or- 
ganization but is considerably larger, with 11 additional pro- 
tein components that are scattered in discrete regions (Fig. 
1B). These 11 proteins include the EJC (eIF4AIII, MAGOH, 
Y14, and MLN51), the U5 snRNP component U5-40K, the NTR 
proteins Aquarius and RBM22, and 4 PPIs. In contrast to the 
yeast C complex structure (JO), Prp16 and Brr2 are unambig- 
uously identified in the human C complex. The human pro- 
tein RBM22 bears significant sequence homology to the yeast 
NTR proteins Cwc2 and Ecm2 (22), likely reflecting conver- 
gent evolution. Other than Cwc2 and Ecm2, all other proteins 
in the structure of the yeast C complex have corresponding 
functional orthologs in the human C complex (Fig. 1B). 

Previously unknown structural features have also been 
observed for a number of the human proteins that have func- 
tional orthologs in S. cerevisiae. Most notably, compared to 
their S. cerevisiae counterparts Cwce25 and Yju2, a number of 
previously unrecognized structural features of CCDC49 and 
CCDC94 have been identified. These structural elements play 
a crucial role in connecting the active site RNA elements to 
Prp16. Because the human and yeast C complexes share 
nearly identical structural features in the core of the spliceo- 
some, we focus our discussion on the new structural elements 
and the human-unique proteins. 


The RNA map and the active site 

The RNA sequences in the core regions of the human C com- 
plex display clear features in the EM density map (fig. S5). Of 
the 414 modeled nucleotides, 174, 84, and 97 are assigned to 
U2, U5, and U6 snRNA, respectively (table S2). The remaining 
59 nucleotides belong to pre-mRNA, with 13 nucleotides in 
the 5’-exon and 46 in the intron lariat-3’-exon intermediate. 
The overall structure of the three snRNAs, the 5’-exon, and 
the intron lariat-3’-exon intermediate in the human C com- 
plex is quite similar to that of the yeast C complex (JO, 11) 
(Fig. 2, A and B). 

The active site of the human C complex comprises the in- 
tramolecular stem loop (ISL) of U6 snRNA, the catalytic tri- 
plex between U2 and U6, the loop I of U5 snRNA, and five 
metal ions. Three of the five metals, presumably Mg”*, appear 
to stabilize the ISL by neutralizing the negative charges of the 
RNA backbone phosphates (Fig. 2B, left panel). All three met- 
als are identically coordinated in the yeast C complex (10, 17) 
(Fig. 2B, right panel). 

Two catalytic metal ions known as M1 and M2 are thought 
to play indispensable but reciprocal roles in the branching 
reaction and exon ligation (23, 24). Compared to the yeast C 
complex (J0), the position of M2 is shifted by approximately 
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3 Ain the human C complex (Fig. 2B, right panel). Coordina- 
tion of M1 in the human C complex is different from that in 
the yeast complex where M1 is specifically bound by the 3’- 
OH at the 3’-end of the 5’-exon (10, 17). In the human C com- 
plex, the ribose at the 3’-end of the 5’-exon is flipped by ap- 
proximately 10 A away from M1 (Fig. 2C). Consequently, M1 
is only coordinated by three ligands: the phosphate of the nu- 
cleophile-containing adenine nucleotide in the branch point 
sequence (BPS) and the phosphates of G72 and U74 of U6 
snRNA (Fig. 2B). 


The step I splicing factors and Prp16 

Similar to the yeast C complex (10, 11), the conformations of 
the active site RNA elements in the human C complex are 
sustained by 15 surrounding protein components, particu- 
larly CCDC49 (Cwe25 in S. cerevisiae) and CCDC94 (Yju2 in 
S. cerevisiae), the NTC components Isyl1 and the N-terminal 
fragment of Cdc5, and the RNaseH-like domain of Prp8 (Fig. 
3, A and B, and fig. S6). The ATPase/helicases Brr2 and Prp16 
in the peripheral regions are connected to the core of the 
spliceosome mainly through CCDC49 and CCDC94. 

The structurally identified fragment of CCDC49 com- 
prises an extended a-helix (N-helix, residues 17-61), which is 
capped by an N-terminal plug (residues 2-16) and a C-termi- 
nal hook (residues 62-80) (Fig. 3C). Three Trp residues 
(Trp12, Trp24, and Trp72) are invariant among the yeast 
orthologs. The N-terminal plug is placed into the active site 
(Fig. 3, B and D), with a conserved Gly-rich motif (Gly2-Gly3- 
Gly4 in CCDC49) penetrating into a small crevice generated 
by the U2/BPS duplex and the helix Ia of the U2/U6 duplex 
(Fig. 3E and fig. S6). Residues 5-16 of the N-terminal plug in- 
teract with the major groove of the U2/BPS duplex and the 
zinc-binding domain of CCDC94. Trp12 stacks against Phe76 
of CCDC94. The N- and C-terminal ends of the N-helix inter- 
act with the 1585-loop of Prp8 and Prp16, respectively (Fig. 3, 
D and F). Trp24 directly contacts the tip of the 1585-loop. The 
hook associates with the RNaseH-like domain (Fig. 3G). No- 
tably, the interactions of CCDC49 with Prpl6 and the 
RNaseH-like domain are unknown in the S. cerevisiae C com- 
plex due to the lack of the C-terminal hook and half of the N- 
helix in the structurally identified portion of Cwc25 (10). 

Although both CCDC94 and CCDC130 share sequence ho- 
mology with the S. cerevisiae Yju2 (Fig. 4A), the fine features 
of the EM density map are only consistent with CCDC94, not 
CCDC130. The structure of CCDC94 consists of an extended 
N terminus, a Cys4-type zinc-binding motif, and three a-heli- 
ces (H1/H2/H3) at the C terminus (Fig. 4B). Similar to 
CCDC49, the N terminus is inserted deeply into the active 
site, interacting with the 5’-exon, the U2/BPS duplex, Isyl, 
and Prp8 (Fig. 4C and fig. S6). The zinc-binding motif con- 
tacts the ISL, the U2/BPS duplex, the N-helix of CCDC49, and 
the N-terminal portion of Cdc5 (Fig. 4D and fig. S6). The a- 
helices H1/H2/H3 interact with Prp16, Prp8, the N-terminal 
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portion of Cdc5, and the C-terminal HAT repeats of Syfl1 (Fig. 
4E). Notably, the interactions involving the CCDC94 helices 
H1/H2/H3 are previously uncharacterized in the S. cerevisiae 
C complex because these helices remain structurally uniden- 
tified in Yju2 (10, 11). 

The elongated helices of CCDC49 and CCDC94 directly as- 
sociate with Prp16 (Fig. 4F). Specifically, the N-helix of 
CCDC49 binds the RecA2 domain of Prp16, whereas the heli- 
ces H2/H3 of CCDC94 interact with the OB and Ratchet-like 
domains. Prp16, in turn, interacts with the Jab1l/MPN domain 
of Prp8 and the N-terminal Plug (residues 107-179) and the 
PWI-like domain (residues 1030-1181) of Brr2 (fig. S7A). The 
Jab1/MPN domain also stably associates with Brr2 (25). The 
RNA-binding site of Prp16 remains unoccupied (fig. S7, B and 
C), consistent with the structural comparison between Prp16 
of the human C complex and Prp43 bound to RNA (26) (fig. 
S7D). Prp16 is thought to remodel the C complex by pulling 
the 3’-end sequences of the intron lariat-3'-exon intermediate 
(3, 4, 27, 28). In the human C complex, the putative RNA- 
binding site of Prp16 is located approximately 60 A away 
from the last ordered nucleotide at the 3’-end of the intron 
lariat-3'-exon intermediate (Fig. 4F). 9-10 RNA nucleotides 
are minimally required to cover the distance, consistent with 
the observation that shortening of the 3’-exon led to the inhi- 
bition of exon ligation (4, 29). 


Peptidyl prolyl isomerases (PPIs) 

PPI is absent in the structure of the S. cerevisiae C complex 
(10, 11). Eight PPIs are known to be present in the human 
spliceosome (30) (fig. S8 and table S3), of which four have 
been identified in the structure of the human C complex. 
CypE contains an RNA recognition motif (RRM, residues 6- 
99) and a canonical PPI domain (residues 108-266) (fig. S8). 
In the human C complex, both the RRM and the PPI of CypE 
associate with the intron lariat (Fig. 5A). The RRM also binds 
the U2 snRNP component U2-A’ (Leal in yeast) and the su- 
perhelical protein Syf1; the PPI interacts with U2-A’, Isyl, and 
the heptameric U2 Sm ring. The C-terminal sequences of SmB 
contain a conserved motif GLXGPVRGVGGP (Fig. 5A), which 
is spatially close to the PPI domain of CypE and may serve as 
a substrate. 

PPIL1 (Cyp1 in S. pombe) closely interacts with the ex- 
tended sequences of both SKIP (Prp45 in yeast) (31, 32) and 
the step II factor Prp17 (Fig. 5B). PPIL1 also binds both the N- 
and C-domains of RBM22. The PPILI-bound sequences of 
Prp17 contain the motif NPX;PXXGP, which likely represents 
the substrate. The remaining two PPIs associate with the 
RNaseH-like domain or the RT Finger/Palm domain of Prp8s. 
One PPI is located close to the C-terminal sequences of 
CCDC49, which harbor the sequence motif FAPX;;DP but 
lack a conserved Gly residue that is usually required for sub- 
strate specificity (33, 34) (Fig. 5C). The other PPI may catalyze 
proline isomerization of Ad-002 (Cwcl5 in yeast) (Fig. 5D). 
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These two PPIs may represent PPWD1 and PPIG, although 
the current EM density map does not allow conclusive assign- 
ment. 

All four PPIs bind the extended sequence elements of the 
protein components in the human spliceosome. The switch 
of the proline cis-trans configuration usually results in alter- 
ation of the local structure, which is known to trigger signal- 
ing and impact on a range of important cellular processes (34, 
35). These PPIs are thought to join spliceosomal complexes at 
distinct stages of the splicing reaction to regulate pre-mRNA 
splicing (30, 36). The structural changes caused by proline 
isomerization likely lead to alteration of the interactions be- 
tween the substrate protein and surrounding components of 
the spliceosome. 


Structural comparison with the human C* complex 

Structural resolution of the human C complex allows compar- 
ison with the human C* complex (J9, 20) and visualization of 
the compositional and conformational changes. These 
changes are concentrated in two aspects. First, the protein 
components at the peripheral regions on one side of the 
spliceosome have undergone major reshuffling, exemplified 
by the large-scale movement of the U2 snRNP (Fig. 6A, left 
panels). The overall translocation of the U2 snRNP exceeds 
100 A (fig. S9 and movies S1 and S2). The RRM domain of 
CypE binds the intron sequences upstream of the BPS in the 
human C complex but instead interacts with the helix IIa/b 
of U2 snRNA in the human C* complex (19) (fig. $9). In con- 
trast, the RRM domain binds Syf1 identically in both the C 
and C* complexes. Such changes may be facilitated by proline 
isomerization in the U2 snRNP component SmB. During the 
C-to-C* transition, the protein PRKRIP1 is recruited into the 
C* complex to stabilize the new position of the U2 snRNP. 
Brr2 is translocated by about 150 A in the C-to-C* transition 
(Fig. 6A, middle panels, and fig. S10). Prp16 is dissociated in 
the transition and Prp22 is recruited into the C* complex (9). 
Second, the protein components at or near the active site ex- 
hibit major changes. During the transition, CCDC49 and 
CCDC94, along with Isyl, are dissociated (Fig. 6A, right pan- 
els). The step II splicing factor Slu7 is recruited into the C* 
complex; Prp17 and the RNaseH-like domain of Prp8 undergo 
major positional changes (movie S3). During the C-to-C* tran- 
sition, the WD40 domain of Prp17 is translocated by approx- 
imately 70 A, the RNaseH-like domain of Prp8 is rotated by 
about 70 degrees, and the f-finger is inserted into the gap 
between the U6/5'SS duplex and the U2/BPS duplex (fig. S10). 


Discussion 

Our structural analysis reveals a working model for the C-to- 
C* transition (Fig. 6B). In the C complex, the active site RNA 
elements are stabilized by CCDC49 and CCDC94 together 
with Isyl1, Prp17, and the RNaseH-like domain of Prp8. The 
ATPase/helicase Prp16 is framed adjacent to the active site 
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mainly through interactions with the two step I factors, and 
Brr2 binds to the other side of Prp16 from the periphery of 
the spliceosome. Given their direct interactions with the 
U2/BPS at the active site, CCDC49 and CCDC94 may be im- 
mediately dissociated by the action of Prp16 (27). This is likely 
followed by the dissociation of Prp16 because it is bound to 
the spliceosome mainly through the two step I factors. These 
changes allow the recruitment of Slu7, PRKRIPI, and the 
ATPase/helicase Prp22, ultimately forming the C* complex. 

The step I factors CCDC49 and CCDC94 stabilize the ac- 
tive site RNA conformation to prime the branching reaction. 
This structural finding is fully consistent with available bio- 
chemical and genetic observations (37-39). The structure of 
the human C complex also reveals notable features of 
CCDC49 and CCDC94: that were previously unknown in the 
S. cerevisiae C complex. These structural features - elongated 
N-helix and the hook in CCDC49 and three a-helices 
H1/H2/H83 - directly interact with the ATPase/helicase Prp16 
and the central protein Prp8 (Figs. 3 and 4), linking Prp16 to 
the active site RNA elements. 

Structural analysis suggests a key role for the 1585-loop in 
the upload of the 3’-exon into the active site (fig. S11). In the 
human C-to-C* transition, the location of the 1585-loop flips 
around the sequence element at the 3’-side of the lariat junc- 
tion. In the yeast P complex (/4-16), the 1585-loop is located 
in a position that resembles that in the human C* complex 
(19) but is sandwiched by the 3’SS of the intron lariat and the 
ligated exon (fig. S11). During these processes, the confor- 
mation of the 1585-loop undergoes drastic changes. 

In this study, the splicing inhibitor BN82685 was used to 
enrich the human C complex. The EM density map in the core 
of the spliceosome does not support the presence of the in- 
hibitor. Although direct binding of BN82685 cannot be ruled 
out, a more likely scenario is that the inhibitor regulates the 
kinase or phosphatase activity that is essential for pre-mRNA 
splicing (21). Structure of the human C complex contains 11 
additional proteins compared to that of the S. cerevisiae C 
complex (10, 11). The majority of these additional proteins in 
the human spliceosome play a regulatory role and are a testi- 
mony of the more sophisticated nature of pre-mRNA splicing 
in mammals. 
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Fig. 1. Cryo-EM structure of the human catalytic step | spliceosome (C complex) at 4.1 A resolution. 
(A) Overall structure of the human C complex. Two perpendicular views are shown. The atomic model 
includes 47 proteins, three snRNAs (U2, U5, and U6), a 5'-exon, and a 44-nucleotide intron lariat, with a 
molecular weight of approximately 1.8 MDa. Among the modeled 15,479 amino acids, about two-thirds 
have side chains. All structural images in the figures were created using PyMol (40). (B) An overall 
structural comparison between the C complexes from human (left) and S. cerevisiae (right). In both 
structures, the U2, U5, and U6 snRNAs are colored marine, orange, and green, respectively; the pre-mRNA 
is highlighted in red; and the shared protein components are shaded gray. Protein components that are 
uniquely present in either structure are color-coded. Compared to the structure of the yeast C complex, 
11 additional protein components have been identified in the human C complex. These include 4 peptidy| 
prolyl isomerases (PPIs), 4 proteins of the exon junction complex (EJC) (elF4Alll, MAGOH, Y14, and 
MLN51), U5-40K, RBM22, Aquarius. The EM density map also allows unambiguous identification of the 
ATPase/helicases Prpl6 and Brr2, which were absent in our own preparation of the yeast C complex (10). 
Notably, the human protein RBM22 appears to be derived from the yeast Ecm2 and Cwc2 (22). 
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Fig. 2. The RNA elements and the active site metals in the human C complex. (A) Overall 
structure of the core RNA elements. The color-coded RNA structure of the human C complex is 
shown in the left panel, and its comparison to that of the S. cerevisiae C complex (colored gray) is 
displayed in the right panel. In the C complex, the 5'-exon remains anchored to the loop | of U5 
snRNA. The nucleophile-containing adenine nucleotide from the branch point sequence (BPS) is 
covalently joined to the guanine nucleotide at the 5'-end of 5'SS. The disordered RNA sequences of 
the intron lariat are indicated by the dotted lines. (B) Coordination of the metal ions in the active 
site of the human C complex (left panel) and its comparison to the S. cerevisiae C complex (right 
panel). All structural elements of the S. cerevisiae C complex are shown in gray. Both the location 
and the coordination of the three structural metals (colored magenta) in the active site of the 
human C complex are nearly identical to those in the S. cerevisiae C complex. However, despite a 
similar location for the catalytic M1 metal (colored red) in the human C complex, its coordination 
has changed compared to the yeast C complex. In addition, compared to the yeast C complex, the 
catalytic M2 in the human complex has been translocated by about 3 A. (C) A close-up view on the 
coordination of the M1 metal. Due to a positional shift of the ribose of the guanine nucleotide (G.1) 
at the 3'-end of the 5'-exon, the M1 metal is coordinated by the 3'-OH of G. only in the S. cerevisiae 
C complex, but not in the human C complex. 
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Fig. 3. Structure and functional implications of the step | splicing factor CCDC49 in the 
human C complex. (A) An overall view on the protein components surrounding the active site. 
The protein components are color-coded. The ATPase/helicase Prpl6 is placed between Brr2 
and the core of the spliceosome. The step | splicing factors CCDC49 and CCDC 94 correspond to 
the yeast proteins Cwc25 and Yju2, respectively. (B) A surface view on the key proteins that 
stabilize the active site conformation. The N-termini of both CCDC49 and CCDC94 insert deeply 
into the active site to interact with the RNA elements. The RNaseH-like domain of Prp8, the NTC 
component Isy1, and the N-terminal portion of Cdc5 also stabilize the RNA elements at the active 
site. (C) Domain structure of CCDC49 and its sequence alignment with Cwf25 (S. pombe) and 
Cwc25 (S. cerevisiae). Three invariant Trp residues of CCDC49 — Trp12 in the N-terminal plug, 
Trp24 in the N-helix, and Trp72 in the C-terminal hook — interact with CCDC94, the 1585-loop of 
Prp8, and the RNaseH-like domain of Prp8, respectively. (D) A close-up view on CCDC49. The 
three invariant Trp residues are indicated. (E) A close-up view on the N-terminal plug of CCDC49 
and its interactions with surrounding components. The N-terminal Gly2-Gly3-Gly4 motif is 
inserted deeply between the U2/BPS duplex and the U2/U6 duplex at the active site. Trpl2 of 
CCDC49 interacts with hydrophobic amino acids from CCDC94. (F) A close-up view on the N- 
terminal portion of the N-helix. Trp24 of CCDC49 directly contacts the 1585-loop of Prp8. (G) A 
close-up view on the C-terminal hook of CCDC49. Shown here is the EM density map in this 
region. The C-terminal portion of the N-helix interacts with Prp16 whereas the hook binds to the 
RNaseH-like domain of Prp8, with Trp72 of CCDC49 playing a critical role. 
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Fig. 4. The step | splicing factor CCDC94 and the ATPase/helicase Prp16 in the human C complex. 
(A) Sequence alignment involving the N-terminal fragment and the zinc-binding domain of CCDC94. The 
corresponding sequences from CCDC94 and CCDC130 (human), Cwfl6 and Saf4 (S. pombe), and Yju2 
(S. cerevisiae) are aligned. Compared to CCDC130 and Saf4, CCDC94 and Cwfl6 shares a higher degree 
of sequence similarity with Yju2. (B) The structure of CCDC94 is shown below its schematic domain 
representation. CCDC94 sequentially consists of an N-terminal fragment, a Cys4-type zinc-binding motif, 
and three a-helices known as H1/H2/H3. The protein and RNA components that interact with these 
domains are indicated below the schematic domain representation. (C) The N terminus of CCDC94 
inserts deeply into the active site, making direct interactions with Isyl, Pro8, U2/BPS duplex, and 5'SS. 
(D) The zinc-binding motif binds the N terminus of CCDC49, U6 snRNA and the U2/BPS duplex. (E) The 
helix H2 interacts with Prpl6, Prp& and the N-terminal domain of Cdc5, whereas the helix H3 binds to the 
N-terminal domain of Cdc5 and the C-terminal region of Syfl. (F) Prp16 is connected to the active site 
RNA elements through its interactions with the two step | factors CCDC49 and CCDC94. Two 
perpendicular views are shown. Prpl6 is thought to pull the 3'-exon sequences in a 3'-to-5' direction, 
dissociating CCDC49 and CCDC94 and eventually forming the step II activated spliceosome (the C* 
complex). 
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Fig. 5. The location and putative mechanism of the peptidyl prolyl isomerases (PPIs) in the 
human C complex. (A) CypE associates with the intron lariat through both the N-terminal RRM 
domain and the C-terminal PP! domain. The PPI domain of CypE may catalyze proline isomerization in 
the C-terminal fragment of the SmB protein. Sequence alignment of SmB from five organisms is 
shown below the structure. Conserved residues are boxed and invariant residues are shared red. (B) 
PPIL1 associates with SKIP, Prp17, and RBM22. Sequence alignment of Prp17 from five organisms is 
shown below the structure. Prp17 may serve as the substrate for PPIL1. (C) A PPI domain associates 
with the RNaseH-like domain of Prp8. Sequence alignment of CCDC49 from five organisms is shown 
below the structure. This PPI might come from PPWDI1. (D) A PPI domain interacts with the RT 
Finger/Palm domain of Prp8&. The C-terminal sequences of Ad-O02 may serve as the substrate for this 
PPI, which is tentatively assigned to PPIG. 
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Fig. 6. A working model on the transition from the human C complex to the C* complex. 
(A) Structural comparison between the human C and C* complexes. Three surface views of 
the human C and C* complexes are shown in the upper and lower panels, respectively. The 
protein components that are unique to either complex or undergo marked positional 
movement are color-coded. (B) A working model on the transition from the human C complex 
to the C* complex. An overall structural comparison between the human C and C* complexes 
is shown in the upper panels. The protein components that remain unchanged in the transition 
are removed to better display the key players. A cartoon diagram of the working model is 
shown in the lower panels. In this model, the transition is triggered by Prpl6, which, through 
ATP binding and hydrolysis, pulls the 3'-end single-stranded sequences of the intron lariat-3'- 
exon intermediate. This leads to the dissociation of the step | factors CCDC49 and CCDC94 
and recruitment of step II factors Prp18 and Slu7 and the protein PRKRIPI. Prplé is dissociated 
presumably due to the loss of interactions with the step | factors and Prp22 is recruited into 
the C* complex. 
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Oligosaccharyltransferase (OST) is an essential membrane protein complex in the Endoplasmic 
Reticulum, where it transfers an oligosaccharide from a dolichol-pyrophosphate-activated donor to 
glycosylation sites of secretory proteins. We here describe the atomic structure of yeast OST determined 
by cryo-EM, revealing a conserved subunit arrangement. The active site of the catalytic STT3 subunit 
points away from the center of the complex, allowing unhindered access to substrates. The dolichol- 
pyrophosphate moiety binds to a lipid-exposed groove of STT3, while two non-catalytic subunits and an 
ordered N-glycan form a membrane-proximal pocket for the oligosaccharide. The acceptor polypeptide 
site faces an oxidoreductase domain in standalone OST complexes or is immediately adjacent to the 
translocon, suggesting how eukaryotic OSTs efficiently glycosylate a large number of polypeptides prior 


to their folding. 


N-linked glycosylation is a post-translational modification of 
asparagine residues found in all domains of life (7). The cova- 
lently attached glycans are essential for correct protein fold- 
ing, sorting and secretion, or for modulating specific cell 
surface interactions (2-4). The central enzyme in the N-gly- 
cosylation pathway is the oligosaccharyltransferase (OST), 
which catalyzes the initial transfer of a defined glycan 
(GlesManoGleNAc in higher eukaryotes) from the lipid car- 
rier dolichol-pyrophosphate to polypeptide chains entering 
the secretory pathway in the Endoplasmic Reticulum (ER) (5, 
6). OST specifically recognizes the Asn-Xaa-Ser/Thr motif, 
where Xaa can be any amino acid except proline (7). Whereas 
in bacteria and some lower eukaryotes, this process is carried 
out by a single-subunit oligosaccharyltransferase (ssOST) (8- 
10), most eukaryotes encode large, multi-subunit, membrane- 
embedded OST complexes that contain eight subunits in 
yeast and possibly more in multicellular organisms (11, 12). 
In Saccharomyces cerevisiae five subunits (STT3, SWPI, 
WBPI, OST1 and OST2) were shown to be essential for cell 
survival, while deletion of the remaining three subunits was 
found to reduce complex stability and glycosylation activity 
(11, 13). 

Eukaryotic OST enzymes have been visualized using sin- 
gle particle cryo-electron microscopy or cryo-electron tomog- 
raphy (14-16), but the resolution of these studies, ranging 
from 20-9A, did not allow unambiguous assignment of OST 
subunits. X-ray structures of ssOSTs from bacteria and ar- 
chaea (PglB and AglB proteins, homologous to the catalytic 
STT3 protein of multi-subunit OSTs) have provided insight 
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into the substrate recognition and glycan transfer mecha- 
nisms of prokaryotic enzymes (8, 9, 17, 18). X-ray structures 
of the isolated luminal domains of yeast OST6 and of the hu- 
man Tusc3 (also named N33) have revealed the disulfide for- 
mation and cleavage mechanism of this redox chaperone (19, 
20). However, in the absence of a high-resolution structure of 
a eukaryotic OST complex, it is unknown how the non-cata- 
lytic subunits extend the range of acceptor polypeptides over 
that of ssOST enzyems. To reveal the architecture of eukary- 
otic OSTs and to understand how they recognize and process 
a striking number of acceptor proteins (27), we determined a 
high-resolution structure of the yeast OST complex using sin- 
gle-particle cryo-EM. 


Preparation of OST complex for cryo-EM analysis 

Yeast OST exists in two isoforms, containing either OST3 
or the homologous OST6 subunit. To avoid potential hetero- 
geneity in our purified samples, we generated a S. cerevisiae 
strain lacking the OST6 gene and overexpressed OST3 from a 
plasmid to compensate for the absence of OST6. A 1D4-tag 
fused to the C terminus of the OST4 subunit facilitated effi- 
cient affinity purification and sufficient yields of the OST 
complex despite low endogenous expression levels. The re- 
sulting strain (Aost6 pOST3 OST4-1D4) showed almost com- 
plete N-glycosylation of OST1 and WBP!I proteins, whereas a 
strain expressing only an OST6-containing complex (Aost3 
pOST6 OST4-1D4) exhibited hypoglycosylation (Fig. 1A). 
SILAC-based mass spectrometric analyses demonstrated that 
for 6 out of 8 subunits, the detergent-purified, OST3- 
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containing complex has the same subunit composition and 
stoichiometry as OST in wild-type yeast cells (Fig. 1B and ta- 
ble S1). OST4-derived peptides could not be detected, and the 
observed ratios for OST3 and OST6 showed that the two pa- 
ralogs are mutually exclusive in the assembled OST complex. 
The OST3-containing complex reconstituted into lipidic 
nanodiscs was fully functional as shown by an in vitro glyco- 
sylation assay developed earlier (J0, 22), using a synthetic li- 
pid-linked oligosaccharide (LLO) analog and a fluorescently 
labeled acceptor peptide (Fig. 1C). The measured glycosyla- 
tion activity of 3.5 peptides per minute per OST matches pre- 
viously reported rates of a eukaryotic ssOST (10) (Fig. 1D). 

The resolution of the 3D reconstruction of nanodisc-re- 
constituted OST was 3.3 A based on the FSC = 0.143 criterion 
(figs. Sl and S2). The EM map was of excellent quality in the 
transmembrane (TM) region and in most of the luminal re- 
gions. The luminal domain of SWP1, most distant to the mem- 
brane, featured lower map quality, probably indicating 
higher domain flexibility, but still displayed secondary struc- 
ture features (Fig. 2A and figs. S1, D to F, and S83). Missing 
parts in the model include the luminal domain and TM1 of 
OST3 and the external loop EL5 of STT3, for which no density 
was observed. Because we visualized OST in an apo state, 
these segments are likely mobile (see below). 


Architecture and subunit structure 

The membrane-embedded part of yeast OST contains a 
total of 28 transmembrane helices, with each subunit contrib- 
uting at least one TM segment (Fig. 2, B to D). The membrane 
topologies of the subunits agree with previous predictions 
(11) except for OST5, whose N- and C-termini are located in 
the ER lumen. Previous in vivo experiments suggested that 
OST assembly occurs through the formation of three sub- 
complexes (13). Our structure revealed that the spatial ar- 
rangement of the OST subunits agrees with this subdivision: 
Sub-complex 1 contains OST1 and OST5, sub-complex 2 con- 
tains STT3, OST3, and OST4, and sub-complex 3 contains 
OST2, WBPI, and SWPI (Fig. 2D). At the membrane-embed- 
ded interfaces between sub-complexes, several ordered phos- 
pholipids could be identified, in particular in the lipid layer 
facing the ER lumen (fig. S4, F to I). 

In sub-complex I, the N-terminal helix of OST5 is in close 
contact with the OST1 subunit. OST1 has a single TM helix 
and a luminal domain containing two sub-domains that have 
similar folds and superimpose well on each other (r.m.s.d = 
2.2A for 153 out of 191 residues, fig. S5A). The fold features 
two stacked $-sheets and no a-helices and was previously ob- 
served in other multi-domain proteins, e.g., in aminopepti- 
dases and leukotriene hydrolases (23, 24) (fig. S5B). Because 
it does not contain the catalytic residues in these proteins, its 
function within OST1 cannot be deduced. 

Sub-complex II includes the catalytic subunit STT3, 
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which is homologous to the functionally and structurally re- 
lated ssOST enzymes PglB from Campylobacter lari and AglB 
from Archaeoglobus fulgidus (8, 9, 18). Our structure con- 
firms that yeast STT3 contains a similar transmembrane to- 
pology with 13 TM helices (Fig. 2C) (25, 26), of which TM9 is 
poorly resolved in the EM map and is probably flexible in the 
absence of bound substrate (fig. S6). The luminal domain of 
STT3 also resembles the PglB and Ag]B proteins, although the 
folds are not identical (fig. S7). Importantly, we found the ex- 
ternal loop EL5 of STT3 to be disordered, in line with previ- 
ous observations that EL5 of PglB only becomes fully ordered 
when substrates are bound (J8). The TM helices of STT3 are 
tightly interacting with OST4 (which consists of a single TM 
helix) and with TM2-TM4 of OST3 (Fig. 2C). Although the 
presence of full-length OST3 in the OST complex was con- 
firmed by SDS-PAGE and mass spectrometric analysis (Fig. 
1A and table S1), no clear density for TM1 and the N-terminal 
luminal domain of OST3 was visible in the EM map. We con- 
clude that in the absence of a peptide substrate, the luminal 
domain of OST3 is highly flexible. Both TM1 of OST3 and TM9 
of STT3 are in close proximity to the likely LLO-binding site, 
suggesting that their flexibility might be associated with the 
absence of bound LLO. 

Sub-complex III contains the OST2 subunit, whose TM 
helices mediate contacts between STT3 and the TM helices of 
WBP!I1 and SWPI. OST2 contains an N-terminal o-helix (aa21- 
aa38) that is located at the cytosolic membrane boundary and 
parallel to the membrane plane, where it forms contacts to 
TM8 and TM9 of STT3. The WBP1 subunit contains two lu- 
minal domains. The N-terminal domain shares structural ho- 
mology with various proteins of distinct functions including 
an intraflagellar transport protein or glutamine amidotrans- 
ferase (27, 28) (fig. S5C), whereas the central domain of WBP1 
features a fold found in proteins of the complement system 
(29, 30) (fig. S5D). However, none of these structural homo- 
logs unambiguously identify a potential function of WBP1. 
The SWP1 subunit contains a single luminal domain, which 
is most distant to the membrane of all OST domains and con- 
nected to a single transmembrane helix TM1 by a long linker. 

OST1, WBP1 and SWP1 were previously suggested to be 
involved in substrate recognition and to act as chaperones, 
which would coordinate protein folding and glycosylation 
(31-33). Our structural data show that none of their luminal 
domains adopt a chaperone-like fold. However, these do- 
mains might serve as docking platforms for interaction part- 
ners including chaperones or enzymes acting on nascent 
glycoproteins. For example, the human OST1 homolog Ribo- 
phorin I was shown to interact with the carbohydrate-bind- 
ing protein malectin (34, 35). 

To analyze whether eukaryotic OST complexes share 
common structural features we plotted the degree of se- 
quence conservation between different species onto our yeast 
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OST structure. We found that the active site groove in STT3 
is highly conserved (Fig. 3A and fig. S7D). In addition, regions 
of high sequence conservation were found at the interfaces 
between subunits OST1 and STT3 (1129 A? buried surface 
area) as well as between WBP1 and SWP!1 (2369 A? buried sur- 
face area) (Fig. 3B). This suggests that both the arrangement 
of the luminal domains and the active site are conserved in 
eukaryotic, multi-subunit OST. 

Previous genetic and mass spectrometric analyses sug- 
gested that yeast OST contains seven N-glycosylation sites 
(36). We observed EM density for five of these glycans (at 
N539 of STT3, N336 and N400 of OSTI, N60 and N332 of 
WBP1), whereas the remaining two glycans (at N99 and N217 
of OST1) are located in flexible loop regions and are disor- 
dered. The best-ordered N-glycan is attached to the strictly 
conserved STT3 residue N539, where we observed density 
covering eight glycan moieties (MangGlcNAc,). This N-glycan 
forms interactions with WBP1 and SWPI (fig. S4, A to E) and 
is in the immediate vicinity of the proposed binding site for 
the 14-unit-saccharide of bound LLO (see below). 


Active site and substrate-binding pockets 

The STT3 subunit contains the active site and the accep- 
tor peptide and donor LLO binding pockets. Although we vis- 
ualized yeast OST in the apo state with no substrates bound, 
significant mechanistic insight could be deduced by compar- 
ing our structure to bacterial and archaeal ssOSTs (8, 9, 37). 
A superposition of yeast STT3 and substrate-bound PglB (J8) 
(fig. S7, A and B) revealed that functionally important, con- 
served residues and motifs have a similar spatial arrange- 
ment (Fig. 4A). These include the D-X-D/E motif involved in 
coordinating the catalytic metal ion, an aspartate (D47 in 
yeast STT3 corresponding to D56 in PgIB) that binds both the 
metal ion and the carboxamide group of the acceptor aspar- 
agine, the WWD motif providing hydrogen bond contacts to 
the 8-hydroxyl group of the +2 serine/threonine of the accep- 
tor sequon, and the K586 residue of the so-called “DK motif” 
that contributes additional contacts to the +2 serine/threo- 
nine of the acceptor peptide (Fig. 4A). Residues shown to be 
involved in LLO binding to PglB (/8) are also conserved. 
These include an essential arginine (R404 in yeast STT3) that 
interacts with the pyrophosphate group of LLO, and a tyro- 
sine residue (Y521 in yeast STT3) that forms a hydrogen bond 
with the N-acetyl group of the reducing-end GlcNAc moiety 
(Fig. 4B). 

Comparing the electrostatic surface potential maps of the 
peptide-binding pocket of PglB with the predicted binding 
pocket of yeast STT3 provides a molecular explanation for 
some of the differences in acceptor peptide specificities of 
bacterial and eukaryotic OSTs (Fig. 4C) (38, 39). In PglB, an 
arginine residue (R331) was found to interact with a nega- 
tively charged side chain (Asp or Glu) at the -2 position of the 
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sequon. This arginine is conserved in bacterial ssOST en- 
zymes and correlates with an extended sequon requirement 
(DxNxS/T) for bacterial N-glycosylation (8, 38) (Fig. 4C). In 
contrast, no such requirement is present at the -2 position of 
acceptor sequons in eukaryotes, and no positively charged 
residue is present in yeast STT3 where the -2 side chain is 
expected to bind (Fig. 4C). Instead, a larger cavity providing 
space for more voluminous side chains at the -2 position of 
sequons is observed, which is in line with previous findings 
that human OSTs display an increased glycosylation effi- 
ciency for substrates with aromatic residues at the -2 position 
(39). At the expected binding site of the -1 residue of the se- 
quon, we found a “knob” in the yeast STT3 structure formed 
by the side chain of E45 (Fig. 4C). The reduced space may 
result in a more efficient binding of sequons with smaller side 
chains at the -1 position. 

A groove between TM6 and TM11 of STT3 indicates 
where the dolichol tail of the LLO will likely bind (Fig. 3A). A 
similar groove was demonstrated to represent the polyprenyl- 
binding site in PglB, with four prenyl units ordered in the X- 
ray structure (J8). Given their locations, it is conceivable that 
in yeast OST, the partially ordered TM9 of STT3 and the dis- 
ordered TM1 of OST3 might interact with bound LLO. Our 
results suggest that PglB and yeast STT3 share a common 
substrate recognition and glycan transfer mechanism despite 
differing in their substrate specificities. 

Because glycosylation is an essential process in yeast, 
abolishing OST function prevents cell growth (40). To vali- 
date the importance of the residues identified in the yeast 
OST structure, we therefore used a tester strain that ex- 
pressed the ssOST enzyme LmSTT3D from Leishmania major 
(41) and generated six chromosomal mutations in the STT3 
locus (D47A, D166A, E168Q, E350A, R404A, K586A). These 
mutations were predicted to affect metal binding, peptide 
binding, or LLO binding, but did not affect OST complex sta- 
bility (fig. S7C). With the exception of K586A, which resulted 
in a temperature-sensitive phenotype, all generated muta- 
tions prevented growth in the absence of LmSTT3D (Fig. 4D). 
We conclude that these mutations impaired the catalytic ac- 
tivity of the STT3 subunit and thus of OST function in vivo 
(40). 


Acceptor polypeptide delivery by the redox chaperone 
OSTS or the translocon 

Higher eukaryotes express two paralogs of the catalytic 
STT3 subunit, termed STT3A and STT3B, which are part of 
distinct OST complexes. STT3A-containing OST associates 
with the translocon, thereby bringing native peptide chains 
entering the ER into close proximity of the glycosylation ma- 
chinery (14, 15, 42-46). In contrast, STT3B-containing OST 
complexes (including the yeast enzyme) are standalone units 
that contain either an OST3 or OST6 subunit (homologous to 
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the mammalian Tusc3 and MagT1, respectively), neither of 
which is present in translocon-associated OST. The luminal 
domains of OST6 and Tusc3 have oxidoreductase activity and 
feature a thioredoxin fold. Both can form disulfide bonds 
with acceptor proteins, but OST3- or OST6-containing com- 
plexes process different subsets of disulfide-forming acceptor 
polypeptides in vivo (19, 47, 48). In our apo structure of yeast 
OST, the oxidoreductase domain of OST3 appears disordered 
and is likely flexible, which may allow diverse and transiently 
bound OST substrates to be efficiently glycosylated. 

Recent studies reported a mammalian ribosome-trans- 
locon-OST supercomplex visualized by electron cryo-tomog- 
raphy (14, 15). We docked our high-resolution OST structure 
into this tomography map and found a remarkably good fit 
with the portion of the density previously assigned to OST 
(14, 15) (Fig. 5A). This provides additional evidence that yeast 
and mammalian OST complexes share a common architec- 
ture. Additional density in the tomography map near the N 
terminus of SWP1 (Fig. 5A) probably corresponds to an N- 
terminal extension of ~300 residues of the mammalian SWP1 
homolog Ribophorin II compared to yeast SWP1 (17). 

Our docking analysis suggests that the active site of 
STT3A faces the heterotrimeric translocon complex (49) (Fig. 
5, A and B). The contact point between OST and the trans- 
locon comprises the region corresponding to TM2-TM+4: of the 
OST3 subunit in yeast OST (Fig. 5B). However, OST3 is not 
present in the mammalian, translocon-associated OST (43). 
Instead, recent studies suggest that the OST-translocon inter- 
action is mediated by the DC2 protein (50). A sequence com- 
parison reveals that the 3 TM helices of DC2 share a striking 
similarity with TM2-TM4 of OST3 (fig. S8), lending further 
weight to the docking model shown in Fig. 5. Multi-subunit 
OSTs may thus be described as a modular assembly of an OST 
core complex formed by seven subunits (STT3, OST1, OST2, 
OST4, OST5, WBP1 and SWP1), which associates either with 
oxidoreductases (OST3/6 or Tusc3/MagT1) or with the trans- 
locon. 

The arrangement shown in Fig. 5C allows unhindered ac- 
cess of the polypeptide substrate from the luminal exit of the 
translocon to the the catalytic center of OST. The observed 
separation of ~40 A is in agreement with earlier distance es- 
timates based on the minimal polypeptide length of 65 resi- 
dues between the peptidyltransferase center of the ribosome 
and the first possible glycosylation site (57). It is also in line 
with the previously reported minimal distance of 30-40 A be- 
tween the OST active site and the ER membrane deduced 
from evaluating the glycosylation of integral membrane pro- 
teins (52). 


Conclusions: Functions of OST subunits 
Organisms encoding multi-subunit OSTs are found to 
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glycosylate a significantly expanded range of protein sub- 
strates compared to single-subunit OST enzymes (53). This 
implies that the auxiliary subunits increase the efficiency of 
the catalytic STT3 core by contributing to substrate acquisi- 
tion or by affecting the folding of acceptor proteins. Our yeast 
OST structure suggests that OST2, OST4 and OST5, which 
contain mostly TM helices, have a scaffolding function and 
are thus important for complex stability, but without directly 
contacting the substrates (1, 54). The luminal domains of 
OST1, SWP1, and WBPI may also have structural roles by sta- 
bilizing the STT3 subunit conformation. In addition, some of 
them may directly interact with the substrates. Our structure 
revealed a cavity ranging from the active site of STT3 to the 
WBP1 and SWPI1 subunits, just above the membrane bound- 
ary (Fig. 5C). The cavity is lined by the highly ordered N-gly- 
can attached to N539 of STT3 and is sufficiently large to 
accommodate the glycan moiety of bound LLO substrate. It 
is conceivable that WBP1, SWP1, and possibly even the or- 
dered N-glycan contribute to the recognition of the lipid- 
linked GlesMangGlcNAc. moiety and thus help define the 
preference for an LLO substrate containing terminal, a-1,2- 
linked glucose (55). For OSTI, it is worth speculating that 
given its proximity to the peptide-binding pocket of STT3, it 
might interact with acceptor proteins and influence their 
folding. The active site of the OST complex faces the peptide- 
binding OST3/6 subunits or the translocon, both of which 
present polypeptide substrates in an unfolded state. This ar- 
rangement favors N-glycosylation over the competing folding 
reactions and thus extends the substrate range of OST. 
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Fig. 1. Purification, stoichiometry, and functional characterizaion of yeast OST. (A) Silver-stained SDS-PAGE 
analysis of detergent-purified OST complexes containing either OST6 or OST3. A1D4 affinity tag was fused to the C 
terminus of OST4. Multiple bands for OST1 and WBP1 indicate heterogeneous N-glycosylation. (B) Mass 
spectrometry-based quantification of subunit abundance in purified OST samples containing either OST3 (green 
bars) or OST6 (blue bars) relative to OST complexes from wild type cell extract (n = 3, error bars indicate SD). (C) 
In vitro glycosylation of a fluorescently labeled peptide (TAMRA-DANYTK) by nanodisc reconstituted, OST3- 
containing yeast OST complex, using a synthetic Czo-LLO (NerylCitronellyl-PP-GIcNAcz) as a donor substrate. 
Glycosylated and non-glycosylated peptides were separated using a Tricine gel. (D) Following quantification of band 
intensities in (C), the ratio of glycosylated to unreacted peptide was plotted against the reaction time and fitted using 
a Michaealis-Menten saturation curve (n = 3, error bars indicate SD). 
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Fig. 2. Structure of yeast octa-subunit OST. (A) EM map of OST complex with density covering individual subunits 
colored as in (D). (B) Cartoon representation of the yeast OST structure. Ordered glycans are shown in stick 
representation. (C) Cytosolic view onto the TM region of OST. (D) Structures of the single subunits in cartoon 
representation. EL5 and TM9 in STT3 as well as TM1 of OST3 are shown schematically. The crystal structure of the 
luminal domain of the homologous OST6 subunit (PDB-ID:3G9B) was used to illustrate the OST3 luminal domain 
(grey ribbon in dashed red box). The names of the corresponding human OST subunits are indicated in parentheses. 
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Fig. 3. Sequence conservation in eukaryotic OST complexes. (A) Surface representation of yeast OST complex 
colored according to sequence conservation. The predicted peptide and LLO binding sites are indicated. (B) 
Conserved subunit interfaces between OST1 and STT3 (left) and between WBP1 and SWP1 (right) are marked by 
dotted circles and arrows. 
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Fig. 4. Active center of STT3. (A) Ribbon representation of peptide-bound PglB (PDB-ID: 5OGL) and yeast STT3. 
Residues involved in substrate recognition or metal ion binding are shown as sticks. The numbering in the bound 
peptide (DONATF sequence) is relative to the acceptor Asn residue. (B) Superposition of yeast STT3 (green ribbon) 
and LLO-bound PgIB (grey ribbon). Functionally important residues (pink for PglB residues, blue for yeast STT3 
residues) are shown as sticks, with interactions observed in PgIB indicated with dashed lines. (C) Electrostatic surface 
representations of PgIB and yeast STT3. The peptide bound to PgIB was modeled into the yeast STT3 structure using 
the WWD motif and residues D47/D56 (PgIB/STT3) as anchors. EL5 (aa294-aa322) of PgIB was removed for clarity. 
Corresponding regions revealing the structural basis of distinct peptide specificity are indicated with dashed ovals or 
boxes. (D) In vivo activity assay of STT3 point mutants. 
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Fig. 5. OST-translocon interactions. (A) Yeast OST structure docked into electron cryo-tomography map of the 
mammalian ribosome-translocon-OST complex (EMDB-ID: 3068). A close-up of the dashed region is shown in the 
left panel. Additional EM density above SWP1 and WBP1 (dashed oval) probably corresponds to a ~300 aa N- 
terminal extension present in mammalian SWP1 homolog Ribophorin II. (B) View from the ER lumen onto the 
transmembrane regions of OST and of the Sec61 translocon (PDB-ID:5A6U) (14) following the docking shown in 
(A). Although the OST3 subunits provides all contacts to Sec61 in this docking, it is replaced by the DC2 subunit in 
translocon-associated OST complexes. (C) Model of translocon-associated OST complex architecture and 
function. The orange, curved line depicts a nascent polypeptide entering the ER through the translocon and 
binding to the active site of STT3. An LLO molecule was manually placed in its likely binding pocket, with a red line 
representing dolichol, circled “P” phosphate moieties, and the GlceszManeGIcNAcz moiety depicted by blue and 
green symbols according to standard glycan nomenclature. The black arrow depicts the proposed direction of the 
nucleophilic attack during glycan transfer. 
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Somatic mosaicism in the human brain may alter function of individual neurons. We analyzed genomes 
of single cells from the forebrains of three human fetuses (15 to 21 weeks post-conception) using clonal 
cell populations. We detected 200-400 single nucleotide variations (SNVs) per cell. SNV patterns 
resembled those found in cancer cell genomes, indicating a role of background mutagenesis in cancer. 
SNVs with a frequency of >2% in brain were shared with the spleen, revealing a pregastrulation origin. 
We reconstructed cell lineages for the first five post-zygotic cleavages and calculated a mutation rate of 
~1.3 per division per cell. Later in development, during neurogenesis, the mutation spectrum shifted 
toward oxidative damage and the mutation rate increased. Both neurogenesis and early embryogenesis 


exhibit drastically more mutagenesis than adulthood. 


Somatic mutagenesis is one of the emerging areas of verte- 
brate genome biology. Several studies revealed extensive ge- 
nomic mosaicism marked by hundreds of single nucleotide 
variants (SNVs) per cell in somatic tissues of the human body 
such as skin fibroblasts, intestine, liver, and colon (1-3). Mo- 
saic copy number alterations are also common, and inser- 
tions of retrotransposable elements have been detected (4- 
10). Mosaicism is prominent in the central nervous system, 
with implications for brain evolution and the genomic under- 
pinnings of human neuropsychiatric disorders (JI, 12). 
Roughly 1500 SNVs might be present in mature neurons from 
the adult human cortex, only detectable in the analyzed cell 
and thought to be related to transcriptional activity (73). 
However, the temporal origin of these SNVs during develop- 
ment is unknown. Furthermore, the use of in vitro whole ge- 
nome amplification (WGA) from single nuclei is prone to 
experimental artifacts mimicking SNVs (/4, 15). Here we de- 
scribe the discovery and analysis of mosaic SNVs in neuronal 
progenitor cells in three fetal human brains. Individual pro- 
genitor cells were allowed to proliferate into clonal cell pop- 
ulations, which provided insights into the genomes of the 
founder cells (fig. S1) and provide an estimation of the fre- 
quency and mutation spectrum of mosaic mutations in hu- 
man development avoiding WGA-associated artifacts. 


Discovery and validation of mosaic SNVs 


Brains were collected from phenotypically normal postmor- 
tem human fetuses ranging in age from 15 to 21 weeks post- 
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conception. Based on comparing counts of germline SNVs 
(3809591 for subject 316, 4316547 for subject 320, and 
3746847 for subject 275) to those derived by the 1000 
Genomes Project across different human sub-population, we 
concluded that subjects 316 and 275 were of non-African 
origin, whereas subject 320 (male, 17 pcw) was of African de- 
scent. 

From a bulk culture of dissociated cells of the VZ/SVZ of 
the frontal cortex (FR), parietal cortex (PA) or basal ganglia 
(BG), we grew 31 single cell derived clonal cell populations 
each containing few thousand cells, using the limiting dilu- 
tion approach. Few possible divisions prior to dilution are not 
likely to contribute significantly to the mutation landscape in 
each cell. DNA extracted from the individual clones, the 
source tissue of germinal zones, and the spleen was se- 
quenced to a minimum of 30X genome coverage (fig. S1). For 
3 clones we could not derive enough cells, hence DNA was 
amplified by multi-displacement amplification (MDA) before 
sequencing. 

Mosaic SNVs present in the founder cell of the clones were 
discovered by comparing genomes of clones to each other and 
to genomes of the germinal zone tissue and spleen (table S1). 
We selected those calls with over 35% variant allele frequency 
(VAF) in clones as candidate mosaic SNVs. This limit 
was chosen to exclude mutations arising during culture, 
which should have a VAF of 25% or less. The distribution 
for the SNV discovery dataset is centered at around a VAF 
of 50% based on read depth, as expected for true mosaic 
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variants (figs. S1 and S2). 

When comparing clones to each other, we filtered the re- 
sulting calls based on conformity of their recurrence to clones 
that are expected and are not expected to carry the same mo- 
saic SNVs (fig. S3). Calls from such clone-to-clone compari- 
sons were 98.9% concordant with calls from comparing 
clones to VZ/SVZ brain tissue or spleen (Fig. 1A). However, 
among 68 calls made exclusively from clone-to-clone compar- 
ison, 31 (46%) were missed from clone-to-brain tissue or 
clone-to-spleen comparisons because they corresponded to 
SNVs present in tissues at high frequency (Fig. 1C), demon- 
strating the advantage of the clone-to-clone comparison ap- 
proach. Therefore, the clone-to-clone comparison represents 
an alternative design for the study of mosaicism to the use of 
familial trios (1). 

Eight randomly selected SNVs were all confirmed in the 
clones using PCR and Sanger sequencing (table S2). As an ad- 
ditional validation strategy, we designed an oligonucleotide 
library complementary to the loci of all 6288 SNVs compris- 
ing the discovery dataset and performed capture and deep re- 
sequencing (to ~1000X coverage) in the DNA from 10 clones. 
This confirmed the 50%-centered VAF distribution for a ma- 
jority of SNVs, with a minority (5.1%) having VAF lower than 
35%, perhaps indicating that these could had arisen during 
cell culturing (figs. SI and S4). Accordingly, we estimated our 
false positive rate at around 5%. From an in-silico compari- 
son of our clones with the unrelated and well characterized 
cell line NA12878 we estimated that the sensitivity for discov- 
ering mosaic SNVs in the clones was ~83% (fig. S5). 


Mosaic SNV counts, mutation spectra, and distribu- 
tions across brain regions 

SNVs were found at rates of 108-572 per clone, with clones 
from older brains containing more variants (Fig. 1D), which 
averages to 200-400 SNVs after adjustment for discovery sen- 
sitivity and false positives. No differences in SNV counts for 
clones from frontal and parietal cortex and from frontal cor- 
tex and basal ganglia of the same brain were noticeable (Fig. 
1D). Similarly, the relative contributions of substitution types 
to the mutation spectrum were the same for clones from dif- 
ferent brains and from different brain regions (Fig. 1E). Over- 
all, the transition to transversion ratio Ti/Tv was 0.6, with the 
most frequent substitution type being C:G>A:T transversion. 
This perhaps reflects DNA damage by oxidation, resulting in 
8-Oxoguanine that is later fixated to threonine through incor- 
rect base pairing with adenine (/6). The second most com- 
mon substitution was C:G>T:A transition, which is thought 
to be caused by deamination of cytosine and 5-methylcyto- 
sine (16). Linear approximation of the increase in SNV counts 
over time allowed estimating a mutation rate of 5.1 (95% CI 
1.5 to 9) SNVs per day per progenitor during neurogenesis. 
This projects to a rate of roughly 8.6 (95% CI 1.6 to 20) SNVs 
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per division per progenitor, assuming that the length of cell 
cycle of cortical progenitors is between 27 to 54 hours based 
upon studies in primates (17). The large interval of this esti- 
mation is due to uncertainties in both the per day increase in 
SNVs and in the length of the cell cycle. Inter-individual var- 
iability may further widen the confidence interval of this es- 
timate. 

To genotype the presence of variants across brains we 
used the above referenced capture library to conduct deep se- 
quencing (~1000X coverage) in the source tissue (dorsal and 
basal germinal layers), in the corresponding outer layers con- 
taining mature neurons (frontal, parietal cortex and basal 
ganglia, respectively), as well as in other brain regions (occip- 
ital cortex, cerebellum) and in a peripheral tissue, the spleen 
(table S3). A total of 144 SNVs was reliably genotyped, from 
11 to 68 in each tissue, including spleen, with VAFs between 
0.3% and 30% (Fig. 2A and fig. S6). High VAFs for 7 such 
SNVs were further cross-confirmed with an orthogonal tech- 
nique, droplet digital PCR (table S2 and figs. S7 to S17). How- 
ever, for hundreds of SNVs at much lower VAF (typically 
below 1%) the evidence for their presence in each tissue was 
not significant, likely because of their low VAFs in tissues. 

Almost 60% of the genotyped SNVs (1.4% of total SNVs) 
and 92% of the SNVs with VAF above 2% in at least one brain 
region had nonzero VAF in the spleen (Fig. 2, B and C, and 
figs. S6 and S18). Since the brain is of neuroectodermal origin 
and the spleen of mesodermal origin, these shared SNVs 
likely occurred prior to gastrulation, when the mesoderm, ec- 
toderm and endoderm differentiate from a single-layered ep- 
ithelium. This suggestion is consistent with the range of VAFs 
of these shared SNVs, as there are about 12 cell divisions prior 
to gastrulation (18), corresponding to expected VAFs from 
0.03% to 25% in somatic tissues, depending on how early in 
embryogenesis variants have occurred. Some SNVs were 
shared between the spleen and only some of the brain regions 
(Fig. 2, B and C, and figs. S6 and S18). This could indicate 
regional sub-lineages, i.e., that non-mixing sets of progenitors 
generate neurons in different brain regions, and that these 
distinct populations of progenitors may not have a common 
ancestor with cells in spleen. But conducting more sensitive 
assays is necessary to exclude the possibility of incomplete 
genotyping. Another observation pointing to an early origin 
of the mutations is that the overlap between SNVs in the ba- 
sal ganglia VZ/SVZ and cortical VZ/SVZ with the respective 
differentiated regions was less than the overlap between the 
four regions together (fig. S18), suggesting that most of the 
genotyped mutations found in the brain arise at least before 
the splitting between basal and dorsal regions of the telen- 
cephalon, i.e., at around neural plate stage or earlier. 

Genotyped mosaic SNVs clearly cluster by similar VAFs in 
each brain (Fig. 3 and fig. S19). Based on the average values 
of the frequencies for each cluster and sharing of such SNVs 
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between clones, we concluded that these clusters likely rep- 
resent variants created during sequential post-zygotic cleav- 
ages. Assuming equal contribution of dividing cells to tissues, 
we reconstructed the cell progeny tree and determined the 
precise origin of 84 mutations during the first 5 post-zygotic 
cleavages (Fig. 3B and fig. S19). These mutations typically had 
VAF above 1%, while the remaining ones, typically with VAF 
below 1%, were assigned to later divisions. Only two SNVs 
had conflicting assignment between clusters and clones, 
which perhaps could be explained by mis-clustering or incor- 
rect discovery/genotyping in clones. Alternatively, these con- 
flicts along with the high VAFs for the very first SNV in the 
tree (Fig. 3A) may indicate an unequal lineage contribution 
to tissues due to asymmetric division, unequal proliferation, 
or positive/negative selection (19). 

Using the trees we estimated the average mutation rates 
per division per daughter cell in the early human embryo as 
1.66 + 0.24, 1.18 + 0.33, and 1.05 + 0.22 for brains of the sub- 
jects 316, 320, and 275 respectively. The weighted average and 
variance of the 3 measurements is 1.3 + 0.15, consistent with 
the rate of 1.2 estimated from analysis of de novo SNVs in 
familial trios (20), and lower than the lower bound estimate 
for mutability of neuronal progenitors, thereby suggesting 
that the mutation rate during neurogenesis is higher than 
that in the early embryo. 

We then split the set of mosaic SNVs into early origin - 
those genotyped in tissues by capture experiments, and likely 
late origin - those not genotyped. The spectrum of early mo- 
saic mutations (i.e., the frequency of nucleotide substitution 
in the context of tri-nucleotide motifs) bears little resem- 
blance to the spectrum of SNVs occurring later in neuronal 
progenitors, revealing a shift in mutagenesis during develop- 
ment (Fig. 3C). Early mutations had the same 2.2 Ti/Tv ratio 
as germline variants and had a larger fraction of C:G>T:A 
transitions overall (p-value of 2.2x10- by Fischer’s exact 
test), and particularly in CpG motifs (p-value of 4.3x10~° by 
Fisher’s exact test), implicating the spontaneous deamination 
of 5-methylcytosine as a contributor to the mutagenic process 
(16). The signature of earlier mutations was also similar to 
the signature of de novo mutations in the human population 
(20). As some of early mutations can be passed to the next 
generation through the germline lineage, the convergence of 
their spectrum with de novo and germline variants is ex- 
pected. Later mutations, on the other hand, had a larger con- 
tribution of C:G>A:T transversions (p-value of 8.0x10°” by 
Fisher’s exact test), implicating oxidative damage as a signif- 
icant contributor to the mutagenic process (J6). Furthermore, 
mutation spectrum for these transversions was most similar 
(correlation R=0.90) to the spectrum observed in MUTYH- 
defective colorectal cancer, where repair of 8-oxoguanine, the 
product of oxidative damage, is compromised (27). 
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Properties of mosaic SNVs 

For the following analyses we used all mosaic SNVs in the 
discovery dataset from all clones, totaling to 6288, as muta- 
tion spectra for the three brains were extremely similar (fig. 
$20). The distribution between neighboring mosaic SNVs was 
consistent with uniform random placement across the ge- 
nome (fig. S21). In line with this, no enrichment of exonic and 
intronic SNVs in any gene ontology category was observed 
either when assuming uniform background mutation rate or 
using as a background mosaic SNVs from liver, intestine, co- 
lon, or fibroblasts (/, 2). Roughly 3% of SNVs may have a func- 
tional consequence either by affecting protein coding 
sequence or gene regulation (fig. S21 and table S4). This pro- 
jects to about 12 non-benign SNVs per progenitor cell at 20 
post-conception weeks. A significant depletion of mosaic 
SNVs was observed in DNase hypersensitive sites relative to 
flanking regions (Fig. 4A). The depletion was more pro- 
nounced (10% vs 5%) when utilizing DNase hypersensitive 
sites for fetal brain rather than for a lymphoblastoid cell line, 
suggesting a relation between cells’ epigenome and the gene- 
sis of mutations. Since no such depletion was observed in 
coding relative to intronic gene regions, the depletion is not 
the result of negative selection, rather likely reflects better 
repair efficiency in open DNA regions, as was observed for 
somatic mutations in cancers (22, 23). 

Similar to somatic SNVs in cancers (24) and mosaic SNVs 
in skin fibroblasts (J), we found that the density of mosaic 
SNVs in the brain correlates negatively with most histone 
marks in fetal brain and embryonic stem cells (Fig. 4B). Com- 
parison of our mosaic SNVs with mutational signatures 
found in cancer (25, 26) revealed that signatures 18 and 8, 
found, respectively, in neuroblastoma and medulloblastoma, 
as well as their combination, are the best descriptors for the 
spectrum of mosaic mutations in the developing brain (Fig. 
4, C and D). Mosaic SNVs were equally well described by the 
combination of signatures 5 with 18 and 1B with 18. There- 
fore, signature 18 with suspected etiology of oxidative dam- 
age (27), consistently contributed to the mutation spectrum 
of mosaic SNVs in fetal brain progenitors. This signature was 
mostly similar to late SNVs, while signature 1B to the early 
ones (fig. S22). 


Implications for development and disease 

Our study uncovered extensive mosaicism in human fetal 
brain, with 200-400 SNVs present per brain progenitor cell 
at 15-21 weeks of gestation. This amount of mosaicism is 
likely inherited by cortical postmitotic neurons, as neurogen- 
esis ends around 20 weeks in humans (28). Indeed, our esti- 
mate is in good agreement with the estimate that postmitotic 
neurons have ~300-900 mosaic SNV within one year of birth 
{aa04426 Lodato}. There is an order of magnitude difference 
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between numbers of mosaic SNVs and de novo SNPs (20) im- 
plying a higher effect of mosaic SNVs on normal brain devel- 
opment and disease. Indeed we estimate that up to 12 non- 
benign mutations can be present in neuronal progenitors and 
consequently transmitted to a sizable fraction of daughter 
neurons. It is conceivable that in rare cases some of these 
mutations may have strong deleterious effect, for example, 
initiate overgrowth (29, 30) or neoplastic transformation by 
knocking out key genes. Indeed, the resemblance of mosaic 
SNVs in fetal brain to somatic mutations in brain cancers and 
particularly to medulloblastoma supports the theory that 
cancer driving mutations can happen by chance during back- 
ground mutagenesis (37). 

As dozens of discovered mutations happen prior to gas- 
trulation, our study demonstrates that early post-zygotic mu- 
tations can be reconstructed from the analysis of a handful of 
clones and tissues, opening an avenue for charting individu- 
alized mosaicism maps. As mosaic variants can contribute to 
interindividual phenotypic differences and have been impli- 
cated in individual’s disease risk, we suggest that knowing 
the individual “mosaicome” could be as important as know- 
ing the individual germline genome, particularly given the 
much stronger selection acting on germline variants and the 
lower penetrance of mosaic variants that is likely to be trans- 
lated in milder phenotypes. 

We also discovered a shift in mutagenesis during develop- 
ment characterized by an increased mutation rate and a 
change in frequency of substitution types. We cannot rule out 
that the increased mutation rate can be partially explained 
by inter-individual variation, although we have no evidence 
for such variability. The shift occurs sometime between early 
cleavages and neurogenesis and may be the consequence of 
physiological, biochemical, and gene expression changes 
related to the generation of neurons from neural stem cells. 
Alternatively, the shift may reflect more general developmen- 
tal processes common to all tissues during organogenesis, 
and, based on increased counts of mutations related to oxi- 
dative damage, could be coupled to higher availability of rad- 
ical oxygen species after development of the cardio-vascular 
system of the embryo. If this is the case, we predict that mu- 
tation spectra and rates per division undergo a similar shift 
during development across all somatic lineages. 

Our estimated average mutation rate of 5.1 SNVs per day 
per neuronal progenitor during neurogenesis implies that 
neurons generated at early and later stages of neurogenesis 
will carry different burdens of mosaic variants. This rate is 
three orders of magnitude higher than 0.4 to 2 mutations per 
year accumulated in the germline lineage of adults (20, 32, 
33). It is also 50 times higher than the rate in postnatal stem 
cells of the small intestine, colon and liver, estimated to be 36 
mutations per year (2). Therefore, our results show that the 
prenatal period is intrinsically highly mutagenic likely the 
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consequence of oxidative damage coupled with more fre- 
quent cell divisions. 

We found no difference in SNV count between progenitors 
from cortex and from basal ganglia, implying that mosaicism 
accumulates at similar rates across the brain during neuro- 
genesis. With the observed rate of 5.1 SNVs per day per neu- 
ronal progenitor one can project that cells in the forebrain 
SVZ and hippocampal subgranular zone, where neurogenesis 
and gliogenesis continues for more extended time periods 
(34-36), would accumulate about one thousand mosaic mu- 
tations by the time of birth. This estimate is consistent with 
the estimates of about 1000 mosaic SNVs present in skin fi- 
broblast cells and in stem cells of colon and intestine in chil- 
dren (J, 2); indeed, mutations rates in all somatic 
proliferative cell lineages during prenatal development may 
be similar. 
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Fig. 1. SNV discovery in brains. (A) Three approaches of discovering mosaic SNVs were contrasted: 
comparing clones to the VZ/SVZ tissue of origin, comparing clones to the spleen, and comparing 
clones with each other (see fig. S3). The three approaches give largely concordant calls. The 
comparison is for calls from all three brains. (B) Calls unique to clone-to-original tissue (in blue) and 
clone-to-spleen (in red) discovery approaches are dramatically enriched for bases with less 
confident calling (as defined by the mask of the 1000 Genomes Project). These residual calls were 
not included in the final call set. (C) VAF of genotyped SNVs from deep re-sequencing in all three 
brains. The clone-to-clone discovery approach allows finding high frequency mosaic SNVs in brain 
tissue (green line) that are missed from clone-to-tissue/spleen comparisons. (D) Counts of mosaic 
SNVs per clone increase linearly with fetal age. (E) Contribution of each substitution type to the 
mutation spectrum is not different between different fetuses and brain regions. 
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Fig. 2. Genotyping of SNV in original tissues. (A) Several dozens of mosaic SNVs with VAF of 0.3% 
to 30% in tissues from various brain regions and from spleen are genotyped by the capture- 
resequencing approach (green line). For hundreds more SNVs, the evidence for presence in tissue 
is indistinguishable from background noise (blue line). (B) Venn diagram of genotyped mosaic SNVs 
across brain regions and spleen for subject 316. Almost 60% of mosaic SNVs could be genotyped in 
one or more brain regions and spleen and 44% could be genotyped in all brain regions and spleen. 
(C and D) Comparative VAFs for mosaic SNVs across different brain regions and spleen for the same 
subject. Many SNVs are shared by multiple brain regions and by brain and spleen with similar VAFs 
(shared across two tissues are in green, red and blue, while shared across three tissues are in 
magenta). 
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Fig. 3. Reconstruction of mosaic SNV mutations during early development of subject 316. 
(A) Hierarchical clustering of SNVs genotyped in the different brain regions and spleen by their VAFs 
revealed grouping consistent with SNVs sharing between clones (white squares represent zero VAF). 
Black and gray squares denote, respectively, SNVs discovered in clones and SNVs missed during 
discovery but genotyped afterwards. For completeness five SNVs (marked with *) were included in 
the analysis if present in multiple clones but the corresponding VAF estimation from capture-seq 
was not available. Their VAFs were estimated from whole genome tissue sequencing. Based on the 
corresponding average VAF (shown underneath each cluster), each cluster was assigned to 
consecutive post-zygotic divisions: Dl (no SNVs observed), D2, D3, D4, and D5. (B) The 
reconstructed cell progeny tree during those divisions had only two conflicts of SNVs assignment, 
denoted by “?”, between clusters and clones. “Expected VAF” denotes VAF of mutations arising at 
each stage assuming equal contribution of all progenies to tissues. (C) Mutational spectra of likely 
early mosaic SNVs (solid colors), and presumably later arising SNVs (shaded colors), are different. 
The difference in the spectra is due to the shift in frequency of C:G>T:A transitions, particularly in in 
CpG motifs, and in C:G>A:T transversions. The spectrum of early SNVs is much closer to the 
spectrum for de novo SNVs in the human population (triangle with correlation R-values). Random 
distribution represents correlation coefficients when randomly but proportionally subsampling early 
and late mutations. 
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Fig. 4. Properties of mosaic SNVs in brain. (A) Depletion of mosaic SNVs in DNase hypersensitive 
sites, possibly indicating a better efficiency of DNA repair pathways in those regions (22, 23). 
(B) Density of mosaic SNVs correlates negatively with histone marks in embryonic stem cells and 
fetal brain, revealing similarity to somatic SNVs in cancers. (C) Mutational signatures 8 and 18 found 
in brain cancers have highest correlations with the mutation spectrum of mosaic SNVs. 
(D) Exhaustive combinations of pairs of signatures consistently shows that signatures 1B and 5 also 
contribute to description of the mutation spectrum in combination with signature 18. Thus, signature 
18 is the best descriptor of mosaic SNVs in developing brain. 
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NEURODEVELOPMENT 


Aging and neurodegeneration are 
associated with increased mutations 
in single human neurons 


Michael A. Lodato,””’?* Rachel E. Rodin,””**** Craig L. Bohrson,’* 

Michael E. Coulter,””*** Alison R. Barton,°* Minseok Kwon,”* Maxwell A. Sherman,” 
Carl M. Vitzthum,° Lovelace J. Luquette,® Chandri N. Yandava,® Pengwei Yang,® 
Thomas W. Chittenden,®”* Nicole E. Hatem,””* Steven C. Ryu,»”*? 

Mollie B. Woodworth,””**+ Peter J. Park,**°+ Christopher A. Walsh??? 


It has long been hypothesized that aging and neurodegeneration are associated with 

somatic mutation in neurons; however, methodological hurdles have prevented testing this 
hypothesis directly. We used single-cell whole-genome sequencing to perform genome-wide 
somatic single-nucleotide variant (SSNV) identification on DNA from 161 single neurons from 
the prefrontal cortex and hippocampus of 15 normal individuals (aged 4 months to 82 years), as 
well as 9 individuals affected by early-onset neurodegeneration due to genetic disorders of DNA 
repair (Cockayne syndrome and xeroderma pigmentosum). sSNVs increased approximately 
linearly with age in both areas (with a higher rate in hippocampus) and were more abundant in 
neurodegenerative disease. The accumulation of somatic mutations with age—which we term 
genosenium—shows age-related, region-related, and disease-related molecular signatures and 
may be important in other human age-associated conditions. 


ging in humans brings increased incidence 

of nearly all diseases, including neuro- 

degenerative diseases (7). Markers of DNA 

damage increase in the brain with age (2), 

and genetic progeroid diseases such as 
Cockayne syndrome (CS) and xeroderma pigmento- 
sum (XP), both caused by defects in DNA damage 
repair (DDR), are associated with neurodegenera- 
tion and premature aging (3). Mouse models of 
aging, CS, and XP have shown inconsistent rela- 
tionships between these conditions and the ac- 
cumulation of permanent somatic mutations in 
brain and nonbrain tissue (4-7). Although analysis 
of human bulk brain DNA, composed of multiple 
proliferative and nonproliferative cell types, re- 
vealed an accumulation of mutations during aging 
in the human brain (8), it is not known whether 
permanent somatic mutations accumulate with 
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age in mature neurons of the human brain. Here, 
we quantitatively examined whether aging or dis- 
orders of defective DDR result in more somatic 
mutations in single postmitotic human neurons. 
Somatic mutations that occur in postmitotic 
neurons are specific to each cell and thus can 
only be comprehensively assayed by comparing 
the genomes of single cells (9). Therefore, we 
analyzed human neurons by single-cell whole- 
genome sequencing (WGS). Because alterations 
of the prefrontal cortex (PFC) have been linked to 
age-related cognitive decline and neurodegenerative 
disease (J0), we analyzed 93 neurons from PFC of 
15 neurologically normal individuals (Table 1 and 
tables S1 and S2) from ages 4 months to 82 years. 
We further examined 26 neurons from the hippo- 
campal dentate gyrus (DG) of 6 of these individuals 
because the DG is a focal point for other age- 
related degenerative conditions such as Alzheimer’s 
disease. Moreover, the DG is one of the few parts 
of the brain that appears to undergo neurogenesis 
after birth (1), which might create regional dif- 
ferences in number and type of somatic mutations. 
Finally, to determine whether defective DDR in 
early-onset neurodegenerative diseases is asso- 
ciated with increased somatic mutations, we 
analyzed 42 PFC neurons from 9 individuals 
diagnosed with CS or XP (Table 1 and table S3). 
We isolated single neuronal nuclei using flow 
cytometry, lysed the nuclei on ice in alkaline 
conditions as previously performed (12, 13) (see 
the supplementary materials) to minimize lysis- 
induced artifacts, amplified their genomes using 
multiple displacement amplification (MDA), and 
subjected the amplified DNA to 45X WGS (Fig. 1A). 
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To identify somatic single-nucleotide variants 
(sSNVs) with high confidence, we developed a 
bioinformatic pipeline called linked-read analysis 
(LiRA) (/4) to delineate true double-stranded 
sSNVs from single-stranded variants and artifacts. 
This method employs read-based linkage of can- 
didate sSNVs with nearby germline SNPs and 
performs a model-based extrapolation of the 
genome-wide mutational frequency based on the 
~20% of sSNVs that are sufficiently close to germ- 
line SNPs (Fig. 1B and supplementary materials). 
sSNVs determined by our algorithm (tables S4 and 
S5) showed alternate allele frequency distribution 
strikingly matching that of the germline SNVs 
(fig. S1). sSNV counts were not systematically 
influenced by technical metrics, such as post- 
mortem interval, time in storage, and coverage 
uniformity (fig. S2). 

Across all normal neurons, genome-wide sSNV 
counts correlated with age (Fig. 2A and fig. S3) 
(P = 2 x 10°°, mixed effects model) (see the 
supplementary materials), despite some within- 
individual and within-age group heterogeneity. 
To explore potential variation in different brain 
regions, matched DG and PFC neurons were se- 
quenced in six cases (Fig. 2A and fig. $3). Our 
analysis uncovered region-specific sSNV accu- 
mulation with age in both PFC (P = 4 x 10-°) and 
DG single neurons (P = 2 x 107), suggesting an 
almost twofold increase in the rate of accumu- 
lation in DG (~40 sSNVs/year) relative to PFC 
(~23 sSNVs/year) (P = 8 x 10“). Among the six cases, 
two had significant increases in DG, three had nomi- 
nally increased counts in DG compared to PFC, and 
one had a nominally higher count in PFC (Fig. 2B). 

Neurons from postmortem brains of CS in- 
dividuals showed a ~2.3-fold excess of sSNVs 
relative to the expected age-adjusted normal PFC 
rate, and XP neurons showed a ~2.5-fold increase 
(P = 0.006 for both) (Fig. 2C). Progeroid neurons 
showed a similar number of sSNVs as neurons 
from aged normal PFC, suggesting that defec- 
tive nucleotide excision repair accelerates aging 
through sSNV accumulation. 

Molecular patterns of sSNVs also evolved with 
age. We previously reported that cytosine de- 
amination influences patterns of human neuron 
sSNVs, resulting in abundant C>T mutations (13). 
C>T sSNVs accounted for most variants in the 
youngest PFC samples, but this fraction decreased 
with age (Fig. 2D and figs. S4 and S5). C>T muta- 
tions, although common in many biological con- 
texts (15-18), are also a known artifact of MDA 
(19). Systematic differences in C>T burden dur- 
ing aging suggest that C>T variants are largely 
biological and not technical in nature. T>C var- 
jants increased in the PFC with age (Fig. 2E and 
figs. S4 and S5), possibly representing DNA dam- 
age linked to fatty-acid oxidation (20). As demon- 
strated previously (73), neuronal sSNVs in normal 
PFC were enriched in coding exons (fig. S6 and 
table S6) and displayed a transcriptional strand 
bias (fig. S7), and genes involved in neural function 
were enriched for neuronal sSNVs (fig. S8 and 
table S7). Coincident probability modeling sug- 
gested that the linear accumulation of sSNVs 
in our data set would result in an exponential 
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Fig. 1. Detection of sSSNVs across individuals 
and brain regions using single-neuron WGS 
and linkage-based analysis. (A) Experimental 
outline. (B) Schematic of linkage-based muta- 
tion calling. A somatic mutation (red) may occur 
on one allele of a locus (Locus 1), potentially in 
close proximity to a germline heterozygous site 
(blue and orange), while other loci, such as 
Locus 2, remain unmutated. Later amplification 
errors could create a mismatch (yellow) on one 
strand of one allele of Locus 2 near a germline 
variant (purple). For Locus 1, any WGS read that 
covers both sites and contains the germline 
(orange) variant should also contain the 
somatic (red) variant; thus, these variants are 
perfectly linked. In contrast, at Locus 2, 

some but not all reads that cover the relevant 
germline variant (purple) will contain the 
somatic “variant” candidate (yellow), 
generating two classes of reads, some 

with the somatic variant on that allele, some 
without. Only perfectly linked sSNV candidates 
were considered in this study. 


Fig. 2. sSSNVs increase with age in the 
PFC and DG and are elevated in CS 
and XP. (A) sSNV counts plotted 
against age for neurons derived from 
PFC (circles) and DG (triangles), 
with linear regression lines. There 

is a strong correlation with age 

in both cases, with the rate of 
accumulation being nearly twofold 
higher in the DG than in the PFC. 

(B) Comparison of sSNV counts in 
matched PFC and DG within brains. 
(C) CS and XP patient neurons 
display elevated sSNV counts. 

* denotes P < 0.05 for (B) and 

(C). (D and E) Fraction of 

SSNVs composed of C>T (D) 

and T>C (E). * ** and 

*** denote P < 0.05, 0.001, 

and 0.0001, respectively, using 
two-way analysis of variance 

with Sidak’s correction. 
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accumulation of biallelic deleterious coding muta- 
tions, in agreement with classical hypotheses re- 
garding the relationship between mutation and 
aging (fig. S9) (22), exacerbating differences in sSNV 
load in aging, across brain regions, and in disease. 


Mutational signature analysis (22) revealed 
three signatures driving single-neuron mutational 
spectra (Fig. 3, A and B, and figs. S10 and S11). 
Signature A was composed mainly of C>T and 
T>C mutations and was the only signature to in- 


crease with age (P = 9 x 10°”), independent of 


brain region or disease status (Fig. 3, C to E). Sig- 
nature A resembled a “clocklike” signature found in 
nearly all samples in a large-scale cancer genome 
analysis (Signature 5) (22) (fig. S10). Our data show 
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Fig. 3. Signature analysis reveals mutational processes during aging, 
across brain regions, and in disease. (A) Three mutational signatures 
identified by non-negative matrix factorization (each substitution is 
classified by its trinucleotide context). (B) Number of variants from 
Signatures A, B, and C in each of the 161 neurons in the data set. 

(C to E) Signature A strongly correlates with age, regardless of disease status 
or brain region, whereas Signatures B and C do not. (F) Signature B is 


Lodato et al., Science 359, 555-559 (2018) 


2 February 2018 


enriched in DG relative to PFC neurons. (G) Signature B increased with age in 
DG neurons, but not in matched PFC neurons, revealing a DG-specific 

aging signature. Solid shapes represent regional means, and transparent 
shapes represent individual neurons. (H and 1) Comparison of age-corrected 
estimate of sSNVs per signature in CS and XP compared with PFC 

controls revealed enrichment in Signature C in both CS and XP. * and 

** denote P < 0.05 and P < 0.001, respectively; mixed linear model. 
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that a similar clocklike signature is also active in 
postmitotic cells and hence independent of DNA 
replication. 

Signature B consisted primarily of C>T muta- 
tions and did not correlate with age (Fig. 3D), sug- 
gesting a mutational mechanism active at very 
young ages, perhaps prenatally. Signature B may 
include technical artifacts, which are primarily C>T, 
but bona fide clonal sSNVs are also predominantly 
C>T (8, 12). This signature was enriched in DG 
compared with PFC (P = 2 x 10~*) (Fig. 3F) and 
increased with age in DG, but not in PFC (P = 0.04, 
difference in slopes) (Fig. 3G). The observable dif- 
ference in Signature B between these brain regions, 
and its correlation with age in DG alone, suggest 
that it is dominated by a biological mechanism, 
and these PFC-DG differences strikingly mirror 
differences in neurogenesis. 

A third signature, Signature C, was distin- 
guished from Signatures A and B by the presence 


of C>A variants, the mutation class most closely 
associated with oxidative DNA damage (20). In- 
deed, CS and XP neurons, defective in DDR, were 
enriched for Signature C (P = 0.016 and 0.023, re- 
spectively) (Fig. 3, H and I, whereas Signature C 
also increased modestly with age in normal 
neurons (P = 0.03). An outlier 5087 PFC neuron 
with the highest sSNV rate in our data set had a 
high proportion of Signature C mutations rela- 
tive to other normal neurons, highlighting that 
even within a normal brain some neurons may 
be subject to catastrophic oxidative damage. 
Our analysis revealed that sSNVs accumulated 
slowly but inexorably with age in the normal hu- 
man brain, a phenomenon we term genosenium, 
and more rapidly still in progeroid neurodegenera- 
tion. Within 1 year of birth, postmitotic neurons 
already have ~300 to 900 sSNVs, strikingly dove- 
tailing with the 200 to 400 sSNVs estimated to 
be already present in human progenitor cells at 


Table 1. Case information and number of neurons analyzed in this study. 


Case ID Sex 


Age (years) 


Diagnosis 
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161 neurons 
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20 weeks of gestation or earlier (23). Three sig- 
natures were associated with mutational processes 
in human neurons: a postmitotic, clocklike signa- 
ture of aging, a possibly developmental signature 
that varied across brain regions, and a disease- and 
age-specific signature of oxidation and defective 
DNA damage repair. These associations were pres- 
ent even when data were reanalyzed after removal 
of all C>T mutations (fig. S11), demonstrating that 
any artifactual C>T mutations that may have es- 
caped our filtering do not affect our main con- 
clusions. The increase of oxidative mutations in 
aging and in disease presents a potential target 
for therapeutic intervention. Further, elucidating 
the mechanistic basis of the clocklike accumula- 
tion of mutations across brain regions and other 
tissues would increase our knowledge of age- 
related disease and cognitive decline. CS and XP 
cause neurodegeneration associated with higher 
rates of sSNVs, and it will be important to define 
how other, more common causes of neurodegen- 
eration may influence genosenium as well. 
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CARBON FIXATION 


A primordial and reversible 
TCA cycle in a facultatively 
chemolithoautotrophic thermophile 
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Shigeru Shimamura,°® Katsunori Yanagawa,°{+ Aya Shuto,’ Naohiko Ohkouchi,” 
Nobuyuki Fujita,’+ Yoshihiro Takaki,“® Haruyuki Atomi,*”” Ken Takai® 


Inorganic carbon fixation is essential to sustain life on Earth, and the reductive tricarboxylic 
acid (rTCA) cycle is one of the most ancient carbon fixation metabolisms. A combination 

of genomic, enzymatic, and metabolomic analyses of a deeply branching chemolithotrophic 
Thermosulfidibacter takaii ABI70S6' revealed a previously unknown reversible TCA cycle 
whose direction was controlled by the available carbon source(s). Under a chemolithoautotrophic 
condition, a rTCA cycle occurred with the reverse reaction of citrate synthase (CS) and 

not with the adenosine 5'-triphosphate—dependent citrate cleavage reactions that had 

been regarded as essential for the conventional rTCA cycle. Phylometabolic evaluation 
suggests that the TCA cycle with reversible CS may represent an ancestral mode of the rTCA 
cycle and raises the possibility of a facultatively chemolithomixotrophic origin of life. 


utotrophic carbon fixation is essential for 
sustaining life on Earth. The reductive 
tricarboxylic acid (rTCA) cycle and Wood- 
Ljungdahl pathway have been recognized 
as the most ancient carbon fixation pathways 
to ensure the biosynthesis of the five universal 
precursors of anabolism; acetyl-CoA (coenzyme A), 
pyruvate, phosphoenolpyruvate, oxaloacetate, and 
2-oxoglutarate (J-5). However, essential compo- 
nents of the presently recognized rTCA cycle, the 
adenosine 5’-triphosphate (ATP)-dependent cit- 
rate cleavage enzymes ATP citrate lyase (ACL) or 
citryl-CoA synthetase (CCS)/citryl-CoA lyase (CCL), 
most likely emerged at a later stage from the 
domains of citrate synthase (CS) and succinyl- 
CoA synthetase (fig. S1) (6-9). 
The Aquificae strain Thermosulfidibacter takait 
ABI70S6" has been recognized as a chemolitho- 
autotroph (10, 11), but this study revealed its abil- 


‘Research and Development Center for Marine Biosciences, 
Japan Agency for Marine-Earth Science and Technology 
(JAMSTEC), 2-15 Natsushima-cho, Yokosuka 237-0061, 
Japan. Department of Biogeochemistry, Japan Agency for 
Marine-Earth Science and Technology (JAMSTEC), 2-15 
Natsushima-cho, Yokosuka 237-0061, Japan. “Institute of 
Low Temperature Science, Hokkaido University, Kita-19, 
Nishi-8, Kita-ku, Sapporo 060-0819, Japan. “Department of 
Synthetic Chemistry and Biological Chemistry, Graduate 
School of Engineering, Kyoto University, Katsura, Nishikyo- 
ku, Kyoto 615-8510, Japan. °NITE Biological Resource Center 
(NBRC), National Institute of Technology and Evaluation 
(NITE), 2-49-10 Nishihara, Shibuya-ku, Tokyo 151-0066, 
Japan. “Department of Subsurface Geobiological Analysis 
and Research, Japan Agency for Marine-Earth Science and 
Technology (JAMSTEC), 2-15 Natsushima-cho, Yokosuka 
237-0061, Japan. ’Core Research for Evolutional Science and 
Technology (CREST), Japan Science and Technology Agency 
(JST), 7 Goban-cho, Chiyoda-ku, Tokyo 102-0076, Japan. 
*Corresponding author. Email: takuron@jamstec.go.jp {Present 
address: Faculty of Environmental Engineering, The University of 
Kitakyushu, Japan. {Present address: Faculty of Life Sciences, 
okyo University of Agriculture, Japan. 


Nunoura et al., Science 359, 559-563 (2018) 


ity for chemolithomixotrophic growth with organic 
acids including acetate and succinate in the pres- 
ence of yeast extract (Fig. 1 and supplementary 
text). Hydrogen oxidation coupled with sulfur 
reduction was essential for growth. The genome 
sequence of T. takaii ABI70S6' (~1.8 million base 
pairs) (supplementary text) was not sufficient to 
identify the carbon fixation pathway, although 
the strain was expected to harbor a rTCA cycle 
because of its close phylogenetic relationship with 
other deeply branching chemolithoautotrophic 
Aquificae bacteria (9-1). 

On the T: takaii genome, genes related to the 
(r)TCA cycle are encoded in one gene cluster, and 
only the isocitrate dehydrogenase (ICDH) gene 
is found in a different locus (table S1). Surpris- 
ingly, however, we could not find genes for ACL 
and CCS/CCL. Genes with essential motifs or 
signatures of these enzymes were also absent. 
Genes for the other known microbial carbon 
fixation pathways could not be identified. As for 
potential carbon dioxide fixation enzymes, genes 
for 2-oxoglutarate:ferredoxin oxidoreductase 
(OGOR), ICDH, pyruvate:ferredoxin oxidoreductase 
(POR), pyruvate carboxylase, malate dehydrogenase 
(malic enzyme), phosphoenolpyruvate carboxy- 
kinase (PEPCK), acetyl-CoA carboxylase, and propionyl- 
CoA carboxylase were present (fig. S2 and table S1). 

The activities of these enzymes related to 
carbon fixation were examined with cell-free 
extract obtained from T: takaii cells grown chemo- 
lithomixotrophically. Among the potential car- 
boxylases, activities of POR, OGOR, ICDH, PEPCK, 
acetyl-CoA carboxylase, and propionyl-CoA car- 
boxylase were detected, whereas those of malic 
enzyme and pyruvate carboxylase were not. 
Most of the enzymatic activities in the rTCA 
cycle were detected (table $1). ATP-independent, 
CoA-dependent citrate lyase activity (reverse 
reaction of CS) was present, but ATP/GTP (gua- 
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nosine 5’-triphosphate)-citrate lyase and CoA- 
independent citrate lyase activities were absent 
(Table 1). Both citrate-generating and citrate- 
cleaving CS activities were detected in cells 
grown chemolithoautotrophically. The levels 
of CS activity in both directions and the malate 
dehydrogenase (MDH) activity were compara- 
ble in T. takaii cells grown chemolithoauto- 
trophically and chemolithomixotrophically, 
suggesting that the two enzymes are expressed 
constitutively. Intriguingly, the ATP-independent 
turnover rate of citrate to oxaloacetate in T: takait 
was comparable to the ATP-dependent conver- 
sion rate observed in other deeply branching 
chemolithoautotrophic Aquificae bacteria that 
utilize ACL or CCS/CCL in the classical rTCA 
cycle (11) (Table 1). To address the possibility that 
CS was involved in citrate cleavage in T. takaii, 
we examined the levels of CS activity in other 
deeply branching bacteria and compared them 
to that in T. takaii. As a result, the activity in 
T. takati was at least two orders of magnitude 
higher than those observed in Hydrogenobacter 
thermophilus and Persephonella marina, which 
utilize the rTCA cycle for growth. All of the three 
citrate cleavage activities were not observed in 
Thermodesulfatator indicus cells, consistent with 
the absence of genes for CS, ACL and CCS/CCL 
on its genome. These results suggest that there 
are no enzymes other than CS, ACL, and CCS/ 
CCL that are involved in citrate cleavage in the dee- 
ply branching chemolithoautotrophic bacteria. 

The CS activity in T. takaii was the highest 
among the enzyme activities tested. A high CS 
activity is unlikely to co-occur with carbon fixa- 
tion pathways using acetyl-CoA as an interme- 
diate such as the archaeal 3-hydroxypropionate/ 
4-hydroxybutyrate (3HP/4HB) and dicarboxylate/ 
4-hydroxybutyrate (DC/HB) cycles, as it would 
compete in consuming acetyl-CoA (12). In Thermo- 
proteus neutrophilus, which utilizes the DC/HB 
cycle for CO, fixation and harbors a TCA cycle, 
the specific activity of CS was only 2% of that of 
MDH (13). Enzyme activity measurements in 
T. takaii also suggested the absence of these 
archaeal pathways (table S1). 

RNA sequencing of T: takaii cells indicated 
that all of the genes involved in the TCA cycle were 
transcribed under both chemolithoautotrophic 
and chemolithomixotrophic (with acetate and/or 
succinate) growth conditions (table $2). Shot- 
gun proteome analysis of the cell-free extract 
also confirmed the expression of most of these 
genes in cells grown chemolithoautotrophically 
and chemolithomixotrophically with acetate or 
succinate (table $2). 

The carbon fixation pathway of T. takaii was 
further characterized by isotopologue analysis 
of protein-derived amino acids in cells grown 
with C-labeled substrates (fig. $3). Genomic 
information suggests that alanine (Ala), aspar- 
tate (Asp), and glutamate (Glu) are synthesized 
from pyruvate, oxaloacetate, and 2-oxoglutarate 
by alanine aminotransferase, aspartate amino- 
transferase, and glutamate dehydrogenase, re- 
spectively (fig. S4). The isotopologues of amino 
acids from cells grown chemolithoautotrophically 
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in the presence of ['°C]bicarbonate for 3 
hours can be explained by a citrate cleav- 
age reaction and production of [1-C] 
acetyl-CoA through the rTCA cycle (fig. 
S5 and tables S3 and S4). Intriguingly, 
although anaplerosis is necessary to sus- 
tain the level of cycle intermediates and 
maintain chemolithoautotrophy, the con- 
tribution of anaplerotic pathways was not 
clearly observed. Further studies will be 
necessary to clarify this issue. Shotgun pro- 
teome analysis of the cell-free extract of 
cells grown chemolithoautotrophically con- 
firmed the presence of pyruvate carboxyl- 
ase, PEPCK, and malic enzyme (fig. $2), 
indicating the potential for anaplerosis 
with these enzymes. 

The isotopologues from cells grown 
chemolithomixotrophically in the pres- 
ence of succinate were examined with either 
[2,3-"C]succinate or ['’C]bicarbonate. The 
isotopologue pattern can be explained by 
a bifurcated TCA cycle (fig. S5 and tables 
S3 and S4). The isotopologue pattern of 
Glu indicated the reductive generation of 


2-oxoglutarate from succinate via succinyl-CoA. In 
contrast, that of Ala suggested that pyruvate 
was formed via decarboxylation of malate or 
oxaloacetate and not via conversion of acetyl- 
CoA to pyruvate. Moreover, that of Asp also indi- 
cated the dominance of oxidative generation 
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Fig. 1. Growth curves of T. takaii grown under chemo- 
lithoautotrophic and chemolithomixotrophic conditions. 
A gas mixture of Hz and COz (80:20) was used for all 
cultivation experiments in this study. 


of oxaloacetate from succinate, and a portion 
was supplied by reduction from succinate via 
succinyl-CoA, 2-oxoglutarate, isocitrate, and cit- 
rate. Therefore, in this case, succinate proceeds 
in both the oxidative and reductive directions 
of the cycle, converging at acetyl-CoA via oxalo- 


acetate and pyruvate in the former case 
and via 2-oxoglutarate, citrate, and citrate 
cleavage in the latter. 

The isotopologue patterns in cells grown 
with acetate suggested a bifurcated TCA 
pathway from acetyl-CoA (fig. S5 and and 
tables S3 and S4). Growth with acetate 
was examined in the presence of [1-’°C] 
acetate, [1,2-'’C]acetate, or [°C]bicarbonate. 
The isotopologue pattern of Ala suggested 
that pyruvate was generated from acetyl- 
CoA (carboxylation), whereas that of Glu 
suggested the generation of 2-oxoglutarate 
via oxidation (decarboxylation) of citrate, 
and not via reduction (carboxylation) of 
oxaloacetate. The presence of [5-C] or 
[4,5-7C,]Glu in the cells grown with [1-C] 
or [1,2-!°CyJacetate, respectively, implied 
that a portion of cellular oxaloacetate 
was supplied from the yeast extract com- 
ponent in the medium. Examination of 
Asp and Ala indicated that pyruvate was 
generated from acetyl-CoA derived from 
acetate and oxaloacetate was formed via 
carboxylation of PEP or pyruvate. There- 


fore, in this case, acetate acts as the precursor of 
acetyl-CoA. 

Intriguingly, in the presence of both succinate 
and acetate with [“C]bicarbonate, signatures of 
inorganic carbon fixation were not observed in 
the isotopologues of amino acids (fig. S5 and 


Table 1. Enzyme activities of CS, ATP citrate lyase (ACL), citryl-CoA synthetase/citryl CoA lyase (CCS/CCL), and malate dehydrogenase (MDH) in 
T. takaii cells grown chemolithoautotrophically and/or chemolithomixotrophically along with those of other hydrogenotrophic thermophilic 
bacteria belonging to deeply branching lineages; Thermovibrio, Hydrogenobacter, and Persephonella in Aquificae, and Thermodesulfatator in 
Thermodesulfobacteria. All activities were measured at 70°C. A hyphen (-) indicates not detected; N.T., not tested. The parentheses in ACL and CCS/CCL 
activity measurements indicate that ATP-dependent activity is negligible compared with ATP-independent CS activity. 


Enzymatic activities (nmol product min mg™ protein) 


Species ne re Thermovibrio Hydrogenobacter Persephonella Thermodesulfatator 
Thermosulfidibacter takaii con . f be pss 
ammonificans thermophilus marina indicus 
Growth _Chemolithomixotrophic | Chemolitho- | Chemolitho- Chemolitho- Chemolitho- Chemolitho- 
Enzymes conditions autotrophic autotrophic autotrophic autotrophic mixotrophic 
Deiat nies Bee nee eRe rere oe CEL GLC Frise LICL LE! ii ive cea tone eu AO oe Rea nr AraC re ae MW ee ee 
Oxaloacetate 
Cs ____Production* 208 ee 20 : ; ; ‘ 


Oxaloacetate 


production* 


ACL and 
CCS/CEls 


PERMA_1714, 
PERMA_1715 


MDH 


(620) (71) 330 260 
: Theam_1021, . 
Theam_1022 
A ‘ irLO2Z01. 
HTH_1737 
- - HTH_0311 
11,500 10,300 5,100 5,500 
825 NIT. 2100 200 
TST_O07384 Theam_0602 HTH_O756 


*Oxaloacetate production was assessed by NADH consumption coupled with endogenous malate dehydrogenase. 


reaction of CCL (8). 
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+The CS activity was interpreted as a reverse 


2 of 4 


810g ‘1 Areniqe4 uo /610'Beweouelos‘eouelds//:di]u WO. pepeo|uMOGg 


RESEARCH | REPORT 


tables S3 and S4). This implied that the flux with 
both succinate and acetyl-CoA proceeded in the 
direction of decarboxylation. This is in contrast 
to the rTCA cycle with ACL from Chlorobaculum 
tepidum, which maintains its direction even in 
the presence of acetate (14). Inorganic carbon 
fixation examined with ['’C]bicarbonate was also 
absent in cells grown chemolithomixotrophical- 
ly with yeast extract as the sole organic carbon 
source. The result suggests that the directions 
of the TCA cycle in the presence of succinate 
and/or acetate were not affected by the presence 
of yeast extract. 

A phylogenetic analysis of the CS domain 
indicated that the CS from T. takaii belonged 
to a cluster of CSs from deeply branching an- 
aerobic bacteria, Deltaproteobacteria, Bac- 
teroidetes, and eukaryotes (fig. S6). The CSs of 
Desulfurella acetivorans in Deltaproteobacteria, 
one of which functioned in the rTCA cycle (5), 
were also found in this cluster, whereas Delta- 
proteobacteria is distinct from Aquificae in 
the small subunit ribosomal RNA gene phylog- 
eny. The phylogenetic topology suggests that 
both ACL and CCL likely evolved from a lineage 
of CSs. Among the characterized CSs, the CS 
from T. takaii was most related to the enzymes 
from eukaryotes, and there were no marked dif- 
ferences that suggest a bidirectional role for 
the T. takaii enzyme (fig. S7). 

We produced and purified recombinant CS 
and MDH from T. takaii (Fig. 2). In the case 
of CS, we measured activity in both the direc- 
tion of citrate generation and cleavage. With CS 
alone, only very small amounts of oxaloacetate 
were formed, and it was difficult to measure 
the activity using MDH. With an increase in 
enzyme and longer reaction times, we clear- 
ly observed generation of acetyl-CoA by high- 
performance liquid chromatography (HPLC) 
analysis (fig. S8A), indicating the occurrence 
of the citrate-cleaving activity. When we directly 
coupled the CS activity with the MDH activity, 
a significant amount of ATP-independent cit- 
rate lyase activity was observed, which was de- 
pendent on the presence of CS, citrate, and CoA 
(Table 2 and fig. S9). Production of acetyl-CoA 
under these conditions was also confirmed by 
HPLC (fig. S8B). The Michaelis constant (K,,) 
values for citrate (8.6 4M) and CoA (17.4 uM) 
were extremely low, even when compared to 
those for previously characterized ATP-citrate 
lyase enzymes (Table 3). We next examined the 
citrate-generating activity. Measurements were 
crried out by quantifying the generation of CoA 
with varying concentrations of acetyl-CoA or 
oxaloacetate in the presence of 3000 uM ox- 
aloacetate or 1000 uM acetyl-CoA, respectively 
(Table 2 and fig. S9). During the analysis at 
70°C, we observed thermal degradation of acetyl- 
CoA to CoA, which was problematic particu- 
larly in measurements at low concentrations 
of oxaloacetate and 1000 uM acetyl-CoA. We 
thus carried out measurements in the presence 
of 200 uM acetyl-CoA when varying the con- 
centration of oxaloacetate. The Keat/Km val- 
ues for the citrate-generating reaction were 
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Fig. 2. SDS—-polyacrylamide gel electropho- 
resis of recombinant citrate synthase (CS) 
(TST_0783) and malate dehydrogenase 
(MDH) (TST_0784) proteins from T. takaii 
individually expressed in E. coli. One microgram 
of protein was loaded onto each lane. 


lower than expected, and this may be due to 
the thermal degradation of acetyl-CoA in our 
reactions. However, the high affinity and high 
Keat/Km Value with citrate (and CoA) support 
our hypothesis that the CS from T: takaii func- 
tions in the direction of citrate cleavage in this 
organism. In the case of MDH, we could carry 
out kinetic analyses on the reaction from ox- 
aloacetate to malate. The K,, value for oxalo- 
acetate was 20.1 + 6.2 uM. The Vinax value was 
relatively high (1550 + 155 umol min? mg”), 
and the ieat/Km value was calculated as 43,300 s* 
mM! (Table 3 and fig. S10). We cannot judge 
whether these values are high or not, as there 
are no MDH proteins that have been charac- 
terized from organisms relying on the rfCA 
cycle for growth. We further examined the over- 
all equilibrium of the CS and MDH reactions 
converting citrate, CoA, and reduced nicotin- 
amide adenine dinucleotide (NADH) to malate, 
acetyl-CoA, and NAD* at 50°C and obtained 
an equilibrium constant of K = 0.046 + 0.009 
(fig. S11). Analyses of the two enzymes from 


T. takaii suggest that the enzymes can func- 
tion in the direction of the rTCA cycle. 
Accordingly, we conclude that T: takati con- 
stitutively expresses and utilizes a reversible 
TCA cycle dependent on a bidirectional func- 
tion of CS as its main carbon fixation pathway. 
Energetic and kinetic considerations theoret- 
ically support this possibility, given a suffi- 
cient energy supply via electron bifurcation. The 
proposed rTCA cycle in 7. takaii includes three 
endergonic reactions: the ATP-independent cit- 
rate cleavage reaction (the free energy change 
under standard conditions 5G° = +37.6 kJ/mol) 
(16) and the oxoglutarate synthase (OGOR with 
reduced ferredoxin) (6G = +19 kJ/mol) and 
pyruvate synthase (POR with reduced ferre- 
doxin) (6G = +19 kJ/mol) reactions (17) (table 
S5). The overall reductive conversion of succi- 
nate to oxaloacetate via 2-oxoglutarate, iso- 
citrate, and citrate is endergonic (5G° = +55 kJ). 
The reductive conversion from oxaloacetate 
to succinate is exergonic; &G° = -95 kJ when 
fumarate is reduced by NADH (table S5). The 
conversion should become even more favor- 
able when reduced ferredoxin is used for fu- 
marate reduction. Several reducing reactions 
in anaerobic organisms are driven by electron 
donors with low redox potential, such as re- 
duced ferredoxins. The supply of electron donors 
with low redox potential, critical in sustaining 
the driving force of the reaction in the reduc- 
ing direction, is made possible by flavin-based 
electron bifurcation (17, 18). Therefore, suffi- 
cient depletion of both acetyl-CoA and oxalo- 
acetate and an abundant supply of reducing 
equivalents via electron bifurcation should 
allow the cycle to overcome the energetically 
unfavorable citrate-cleavage reaction of CS. 
The obligate hydrogenotrophic chemolitho- 
trophy of T. takati may serve to ensure the 
sufficient supply of reduced ferredoxin for POR, 
OGOR, and possibly fumarate reductase. By 
contrast, when T. takaii is grown chemolitho- 
mixotrophically with acetate, isotopologue anal- 
ysis indicated that 2-oxoglutarate is generated 
solely by oxidative conversion from acetyl-CoA 
and oxaloacetate, driven by the exergonic CS 
reaction (fig. S5). When T. takati is grown chemo- 
lithomixotrophically with succinate, isotopo- 
logue analysis indicated that a major portion 
of the oxaloacetate is generated by oxidative 
conversion of succinate. A proton motive force 


Table 2. Kinetic parameters of citrate synthase (CS) from T. takaii. 


Substrate Vinax Keat Kn Keat/Km 
(umol min mg7) (s?) (mM) (st mM“) 
ACE |.COA TO =.) 250 FD 0.0563 + 0.0074 uAA4O 
PBKO) ae: lil 0.0412 + 0.0065 


Citrate 


1.11 + 0.02 


0.931 +0.017 


0.00864 + 0.00116 108 


*Taken from the data examining acetyl-CoA. The maximum initial velocity calculated from the data in the 
presence of 200 uM acetyl-CoA was Vinax = 239 + 12 (umol min mg), corresponding to kcat = 200 + 10 (s-). 
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Table 3. Kinetic parameters of citrate synthase (CS) for citrate and malate dehydrogenase (MDH) for oxaloacetate from T. takaii, and comparison 


with those of ATP-citrate lyases (ACLs). 


Source organism 


CS Thermosulfidibacter takaii 


Vmax citrate or 


K,, citrat Keat/K, 
specific activity Keat(s) pares i (ms) Refs. 
(umol min™ mg™) 
MME OIO2 OSs) 2 010l7* 0.00864 + 0.00116 108* 


MDH Thermosulfidibacter takaii 


*Per monomer 


maintained by hydrogen oxidation coupled 
with S° reduction should be necessary for the 
endergonic oxidation of succinate in these cells 
(fig. S5), as in the case of Deswlfuromonas 
acetioxidans (19). 

The rTCA cycle is believed to be one of the 
most ancient carbon fixation pathways on 
Earth. Our results suggest that the ancient 
rTCA cycle in the early life on Earth did not 
necessarily require an ACL or CCS/CCL and 
originated from a bidirectional CS-dependent 
TCA cycle as identified in T. takait. Identifi- 
cation of this previously unknown reversible 
TCA cycle and its functional regulation by the 
availability of carbon sources provides new in- 
sights that will help us understand the evolu- 
tion of carbon fixation and central metabolisms 
in the earliest organisms on Earth. There is a 
long history of debate over heterotrophic ver- 
sus autotrophic origin-of-life scenarios (20). 
Recently, abiotic synthesis of rTCA cycle com- 
ponents was observed in the presence of sulfate 
radicals (27). The directions of the individual 
conversions depended on the reaction, as there 
was no external energy input. When energy 
supplies are sufficient, the metabolic commit- 
ment toward autotrophy and heterotrophy driven 
by the primordial TCA cycle in T. takaii is 
flexible and would be controlled by environ- 
mental conditions such as availability of car- 
bon sources. It has been pointed out that the 
core anabolism is universal at the ecosystem 
level, and the configuration of autotrophy or 
heterotrophy is an ecological distinction (22). 
The reversible TCA cycle, as seen in T. takait, 
would meet these anabolic requirements and 
serve both autotrophic and heterotrophic life- 
styles. The versatile TCA cycle would allow the 
earliest possible chemolithotrophic forms of 
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life, with their limited metabolic network and 
catalytic enzyme (gene) resources, to make the 
most of what is available in their environment, 
such as in deep-sea hydrothermal vents. Thus, 
ancestral forms of life may have originated in the 
form of facultative autotrophs or mixotrophs. 
Further investigations into the possibilities 
of a facultative chemolithomixotrophic origin 
of life will provide key insights into propos- 
ing more likely scenarios for the origin of life 
on Earth. 
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CARBON FIXATION 


Reversibility of citrate synthase 
allows autotrophic growth of 
a thermophilic bacterium 


Achim Mall,” Jessica Sobotta,? Claudia Huber,* Carolin Tschirner,’ 
Stefanie Kowarschik,’ Katarina Baténik,’ Mario Mergelsberg,' Matthias Boll," 
Michael Hiigler,* Wolfgang Eisenreich,** Ivan A. Berg’?* 


Biological inorganic carbon fixation proceeds through a number of fundamentally 
different autotrophic pathways that are defined by specific key enzymatic reactions. 
Detection of the enzymatic genes in (meta)genomes is widely used to estimate the 
contribution of individual organisms or communities to primary production. Here we 
show that the sulfur-reducing anaerobic deltaproteobacterium Desulfurella acetivorans 
is capable of both acetate oxidation and autotrophic carbon fixation, with the 
tricarboxylic acid cycle operating either in the oxidative or reductive direction, 
respectively. Under autotrophic conditions, the enzyme citrate synthase cleaves citrate 
adenosine triphosphate independently into acetyl coenzyme A and oxaloacetate, a 
reaction that has been regarded as impossible under physiological conditions. Because 
this overlooked, energetically efficient carbon fixation pathway lacks key enzymes, it 
may function unnoticed in many organisms, making bioinformatical predictions difficult, 


if not impossible. 


ost organic carbon on Earth derives from 

biological CO, fixation. Six different auto- 

trophic pathways responsible for this 

process are known today (/-3). Each of 

these pathways is characterized by cer- 
tain key enzymes, the genes for which can be 
confidently recognized in (meta)genomic data- 
bases, thus allowing an estimation of the auto- 
trophic potential reaching from single species 
up to entire ecosystems. The reductive tricarbox- 
ylic acid cycle (rTCA cycle, or Arnon-Buchanan 
cycle) is among the most ancient metabolic pro- 
cesses (4, 5), probably emerged from a nonen- 
zymatic precursor (6), and is present in bacteria 
belonging to various phylogenetic groups (J). It 
is a reversal of the oxidative tricarboxylic acid 
(OTCA) cycle, which provides redox equivalents 
and energy in organisms with a respiratory me- 
tabolism. The key step of the oTCA cycle is cit- 
rate synthesis from acetyl coenzyme A (CoA) 
and oxaloacetate, catalyzed by citrate synthase 
(CS). This reaction is regarded as one of the ir- 
reversible steps in the oTCA cycle (J, 2, 7). The 
current consensus is that in organisms using the 
rTCA cycle, CS is substituted either by a revers- 
ible adenosine triphosphate (ATP)-dependent 
citrate lyase (ACL) (8, 9) or by homologous en- 
zymes catalyzing the same reaction in two steps, 
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citryl-CoA synthesis and cleavage (fig. S1) (0, 12). 
The presence of ACL in a bacterium is therefore 
regarded as a key indication for autotrophic CO, 
fixation via the rTCA cycle (72). Our study chal- 
lenges this concept by demonstrating, through 
in vivo and in vitro experiments, that citrate cleav- 
age in the autotrophic rTCA cycle can be catalyzed 
by CS, which was thought to function only in the 
oxidative direction. 

Desulfurella acetivorans is a sulfur-reducing 
thermophilic (growth optimum at 52° to 57°C) 
deltaproteobacterium capable of growing either 
heterotrophically with acetate as an electron 
donor and carbon source (13) or autotrophical- 
ly with molecular hydrogen (74). In the hetero- 
trophic route, acetate is oxidized via the oTCA 
cycle using CS and 2-oxoglutarate:ferredoxin 
oxidoreductase (1/5). Because the genome of 
D. acetivorans encodes neither ACL nor full 
sets of key enzymes for other known auto- 
trophic pathways (table S1), the pathway of CO, 
fixation could not be assigned yet. The genome 
contains pseudogenes for ACL with frame shifts 
in both the a and £ subunits (region 484760- 
487933), which make it nonfunctional. Ampli- 
fication of the D. acetivorans DNA fragment 
containing these pseudogenes and its sequenc- 
ing (6) confirmed that acl was interrupted 
(Fig. 1A). Furthermore, the genome contains 
a gene for 4-hydroxybutyryl-CoA dehydra- 
tase, the characteristic enzyme of the archaeal 
3-hydroxypropionate/4-hydroxybutyrate and 
dicarboxylate/4-hydroxybutyrate cycles (17-19), 
but genes for other specific enzymes of these 
cycles are missing (table S1). Because the ab- 
sence of key enzyme(s) of known CO, fixation 
pathways in an organism capable of autotrophic 
growth can be regarded as an indication of a 
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novel pathway, we decided to study CO, fixation 
in D. acetivorans in more detail. 

In accord with the published data, D. acetivorans 
grew both heterotrophically with acetate/CO, 
and autotrophically on H,/CO, with minimal 
generation times of 5.7 and 4.9 hours to a 
density of 1.5 x 10° and 3.0 x 10° cells per ml, 
respectively. High activities of oTCA cycle en- 
zymes were detected in extracts of cells grown 
under both conditions, with malate dehydro- 
genase (MDH) and CS activities being extremely 
high (Table 1). In contrast, no activities of key 
enzymes of known pathways of autotrophic 
CO, fixation were detected (table S1). By ultra 
performance liquid chromatography (UPLC) 
analysis, we determined a citrate (20 mM)-, 
CoA (1 mM)-, and NADH (reduced form of 
nicotinamide adenine dinucleotide) (5 mM)- 
dependent formation of acetyl-CoA (Fig. 1B and 
fig. $2). Furthermore, a CoA- and citrate- 
dependent oxidation of NADH was observed 
that could be attributed to oxaloacetate for- 
mation followed by NADH-dependent reduc- 
tion to malate by endogenous MDH (Table 1). 
However, none of these reactions were depen- 
dent on the presence of ATP, as would be ex- 
pected if ACL or citryl-CoA synthetase/citryl-CoA 
lyase were operating. 

To further elucidate the fate of citrate, we 
incubated cell extracts of D. acetivorans un- 
der anaerobic conditions in buffer containing 
[U-C,]citrate in the presence of NADH and 
CoA. After 10 min of incubation, the reaction 
was stopped and subjected to °C nuclear mag- 
netic resonance (NMR) analysis. Though the 
specific °C NMR signals for [U-Cg]citrate 
could still be detected, we also observed six °C 
NMR multiplets as a result of product forma- 
tion (Fig. 2 and table $2). On the basis of the 
chemical shifts, the C-"°C coupling constants, 
and titration experiments, these signals were 
unequivocally attributed to [U-'C,]Jmalate and 
[U-C.]acetyl-CoA. Together with the UPLC 
data, these results provide firm evidence for 
the cleavage of citrate to acetyl-CoA and oxalo- 
acetate, with the latter being further reduced 
to malate by MDH, which was highly active 
in our cell extracts (Table 1). The only possible 
enzyme in D. acettvorans that can catalyze an 
ATP-independent citrate cleavage to oxaloace- 
tate and acetyl-CoA is CS. However, with a free- 
energy difference AG of -35.8 kJ mol in the 
canonical direction of citrate formation [at 
pH 7, ionic strength 0.25, and 38°C (20)], the 
cleavage reaction has been regarded as im- 
possible under physiological conditions. Even 
though the next reaction of the rTCA cycle— 
oxaloacetate reduction to malate (catalyzed by 
MDH)-—is highly exergonic [AG of-27.1 kJ mol™ 
at pH 7, ionic strength 0.25, and 38°C (20)], 
the reversal of the CS reaction would require 
high substrate and low product concentrations. 
Using liquid chromatography-mass spectrometry 
(LC-MS), we found that the CoA/acetyl-CoA 
ratio in autotrophically grown D. acetivorans 
cells (93, table S3) was much higher than, for 
instance, in glucose-grown Escherichia coli 
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Fig. 1. ACL-independent citrate cleavage in D. acetivorans. (A) ACL pseudogene in D. acetivorans 
genome, as compared with the corresponding fragment of the genome of a closely related species, 

D. multipotens, which possesses an intact acl gene. Numbers portray the pairwise nucleotides 
identity. bp, base pairs. (B and C) UPLC analysis of the CoA and CoA-esters formed from citrate 
in the reaction catalyzed by D. acetivorans cell extracts (B) and by CS and MDH from porcine 


heart (Sigma) (C). Azeo, absorbance at 260 nm. 


[2.3 (21)], whereas the citrate concentration 
in D. acetivorans was typical for bacterial cells 
[14 mM compared to 2 mM in E. coli (21). 
Using these concentrations of metabolites, we 
calculated an equilibrium oxaloacetate con- 
centration of 0.13 uM. Although it is low, this 
concentration is still in a physiological range, 
similar to, for example, the mitochondrial ox- 
aloacetate concentration that is in the low 
micromolar range (22, 23). [U-'°C,]malate 
and [U-C.]acetyl-CoA could also be found 
when [U-"C,]citrate, CoA, and NADH were in- 
cubated with commercially available CS and 
MDH (from porcine heart, Sigma) (Fig. 1C and 
fig. S3), providing further evidence that the con- 
version of citrate in cell extracts is catalyzed by 
CS. The Michaelis constant (K,,) values of CS and 
MDH measured in D. acetivorans cell extracts were 
close to those for the porcine enzymes (table S4). 
Apparently, the CS of D. acetivorans is not spe- 
cifically adapted to citrate cleavage. This is also 
apparent from the high K,,, value for citrate, thus 
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requiring the observed high specific activity of the 
enzyme. 

In addition to citrate cleavage, we were able 
to detect activity of another characteristic en- 
zyme of the rTCA cycle, fumarate reductase, 
in D. acetivorans cell extracts. This activity 
was measured as fumarate-dependent succinyl- 
CoA formation in an assay coupled with en- 
dogenous succinyl-CoA synthetase (Table 1). 
The identity of the product, succinyl-CoA, was 
confirmed by LC-MS analysis (fig. S4). The 
highest specific activity (9 nmol min™’ mg” 
protein) was observed with dithionite as the 
electron donor; lower activities could be de- 
tected when NADPH (reduced form of nico- 
tinamide adenine dinucleotide phosphate) 
or NADH were used. No activity could be de- 
tected with reduced forms of benzyl viologen, 
methyl viologen, the menaquinone analogon 
2,3-dimethyl-1,4-naphthoquinone, or flavin mo- 
nonucleotide as electron donors. We assume that 
this reaction depends on a different, unidentified 
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electron donor with low redox potential in vivo, 
possibly coupling fumarate reduction by Hy with 
the buildup of a proton-motive force, as was dis- 
cussed for deltaproteobacterium Desvlfuromonas 
acetoxidans (24). 

To test the function of the rTCA cycle in 
D. acetivorans in vivo, we grew autotrophic cul- 
tures in the presence of 0.2 mM [1-'?C]pyruvate, 
which was added in four 0.05 mM portions 
during the exponential growth phase. The cells 
were harvested and hydrolysed under acidic 
conditions. Four of the obtained amino acids— 
Ala, Asp, Glu, and Pro—were then converted into 
tert-butyl-dimethylsilyl derivatives, followed by 
gas chromatography-mass spectrometry analy- 
sis to determine C enrichments and positions 
in the molecules (table S5). Alanine, reflecting 
its precursor pyruvate, displayed ~6% ™C excess. 
Incorporation into aspartate, reflecting its pre- 
cursor oxaloacetate, and into glutamate and 
proline, reflecting the precursor 2-oxoglutarate, 
was lower (1 to 2% ™C excess) but still signif- 
icant. A more detailed MS analysis of masses 
comprising all carbon atoms of the original 
amino acids (i.e., Ala, Asp, Glu, and Pro at a 
mass/charge ratio of 260, 418, 432, and 286, 
respectively, with fragments having lost one 
or two carbon atoms from the respective pre- 
cursor amino acids) revealed some positional 
assignment of the 'C label (fig. S5). Thus, 
comparison of °C excess in Ala?©° with that in 
Ala?*” (still carrying C-2 and C-3 of the orig- 
inal Ala molecule) immediately revealed that 
the “C label in Ala must have been entirely lo- 
cated at C-1, which is lost in the fragment, as ex- 
pected from its origin from the [1-’’C]pyruvate 
precursor. The apparently identical °C enrich- 
ments in Glu**/Glu’™* and Pro?8°/Pro'™* sug- 
gested that the “°C label is located in positions 
2 to 5 in glutamate and proline. The lower °C 
excess in Asp®”° (carrying C-2, C-3, and C-4 of 
Asp) in comparison with Asp“”* (comprising all 
carbon atoms of Asp) indicated that the label must 
be present in C-1 of Asp. As shown in fig. S6, these 
%C distributions can be predicted by shuffling the 
[1-'°C]pyruvate tracer via phosphoenolpyruvate 
(PEP) carboxylation into [1-'’C]oxaloacetate, which 
enters the rTCA cycle and is converted into 2- 
oxoglutarate via the symmetric intermediates fu- 
marate and succinate (fig. S6A). On the other 
hand, pyruvate assimilation via [1-"’Cloxaloacetate 
followed by reactions of the oTCA cycle could not 
explain °C incorporation into Glu and Pro, as the 
8C label becomes lost during the conversion of 
isocitrate into 2-oxoglutarate (table S6 and figs. S5 
and S6B). Together, these results clearly proved the 
existence of a functional rTCA cycle under in vivo 
conditions. 

The operation of the rTCA cycle was also 
confirmed by inhibitor analysis of autotroph- 
ically growing D. acetivorans cultures. Two in- 
hibitors of the cycle, fluoroacetate [which is 
converted through the CS reaction to the acon- 
itase inhibitor fluorocitrate in the cell (25)] 
and glyoxylate [pyruvate:ferredoxin oxidore- 
ductase inhibitor (26)], suppressed growth of 
D. acetivorans (fig. $7). 
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Table 1. Enzymes of central carbon metabolism in D. acetivorans. Activities were measured at 55°C. The number of biological repetitions (n) is indicated. 
ND, not determined; ADP, adenosine diphosphate; acc., accession number. 


Specific activity [umol min mg™ protein] + SD 


Pathway/enzyme Candidate gene(s), 

Hz + CO2 Acetate + CO2 GenBank acc. 
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Succinate dehydrogenase 
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M 


18.4406 (n=3) 
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AHF 96923, AHF96924, AHF96945, 
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138.1 + 11.6 (n = 3) 


Citrate synthase, reverse (NADH oxidation) 
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1.86 + 0.48 (n = 3) 
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AHF96882, AHF96883, AHF97110, 
AHF97111, AHF97112, AHF97113 


Acetyl-CoA synthetase/acetate kinase + 
phosphate acetyltransferase 


Pyruvate synthase 


AHF96645, AHF96951, AHF97587, 
AHF96643, AHF96644 


PEP synthase 


Pyruvate carboxylase 


We designate this novel CS-dependent ver- 
sion of the rTCA cycle as “reversed oTCA cycle” 
(roTCA). The functioning of the roTCA cycle 
requires reduced ferredoxin, which is probably 
synthesized through electron bifurcation. In- 
deed, the D. acetivorans genome contains genes 
for a NAD-dependent ferredoxin:NADPH oxido- 
reductase (NfnAB) (27). Furthermore, the direct 
reduction of ferredoxin in a hydrogenase reac- 
tion is not implausible during growth at high 
H, concentration (80% in our experiments). 

The activities of the enzymes catalyzing the 
key reactions of the TCA cycle (Table 1 and 
supplementary text) were nearly the same for 
D. acetivorans grown organoheterotrophically 
in medium with acetate or lithoautotrophically 
with hydrogen. However, the same set of TCA 
cycle enzymes has to function in either the oxi- 
dative or reductive direction, depending on the 
growth conditions (Fig. 3, A and B). The question 
then arises as to how the regulation of carbon 
flux can occur. Two obvious possibilities are that 
either the presence of H, shifts the flux toward 
the roTCA cycle or that exogenous acetate drives 
the TCA cycle in the oxidative direction. Iso- 
topologue profiling of Desulfurella metabolism 
during growth with or without H, in the presence 
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0.14 + 0.03 (n = 4) 


of CO. and [U-C.]acetate revealed that acetate 
is mostly being oxidized under both conditions 
(tables S7 and S8 and figs. S8 and S9). These 
data disprove the regulatory effect of H. and sug- 
gest that the presence of acetate itself leads to 
increased intracellular acetyl-CoA concentration, 
triggering the direction of the cycle. The detected 
high activities of CS and MDH (Table 1) are not 
required for growth under heterotrophic condi- 
tions. However, the high activities allow fast switch- 
ing from heterotrophic to autotrophic growth and 
reflect the adaptation of D. acetivorans to fluctuat- 
ing acetate concentrations characteristic for the 
D. acetivorans natural environment, where both ac- 
tive acetate formation and oxidation take place (28). 

What makes this newly discovered variant 
of the rTCA cycle so special? First, the roTCA 
cycle requires one less ATP molecule per syn- 
thesized acetyl-CoA compared with the classi- 
cal ACL-dependent cycle (J-3). In bioenergetic 
terms, the roTCA cycle seems to be the most 
efficient pathway of autotrophic CO, fixation 
known today (Fig. 3C). Still, the modified path- 
way is associated with an additional energy 
demand, as an organism using it must in- 
crease the amount of some catalysts to cope 
with their low substrate concentrations [note 
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2) AHF97619 


0.014 + 0.005 (n = 3) AHF96546 


the very high activities of CS and MDH (Table 
1)]. The high activities of the enzymes involved 
in thermodynamically unfavorable reactions 
ensure efficient coupling of the roTCA cycle 
to biosynthetic reactions. Anabolism is highly 
exergonic with a thermodynamic efficiency 
of ~40%, thus providing an additional driv- 
ing force for the autotrophic pathway. Never- 
theless, the required low product-to-substrate 
ratio and high enzymatic activities may ren- 
der this strategy infeasible for some organisms 
or under certain growth conditions. Second, 
the roTCA cycle can hardly be recognized bio- 
informatically. It makes (meta)genome-based 
bioinformatic predictions of the autotrophic 
potential of an anaerobic organism (or micro- 
bial community) difficult, if not impossible. 
Third, many bacteria using the rTCA cycle to 
fix CO, also possess CS genes (29, 30), and con- 
version of the ACL version of the rTCA cycle 
into the roTCA cycle appears to be quite easy. 
Notably, another Desulfurella species, D. multi- 
potens, possesses an intact acl gene (Fig. 1A) 
but apparently uses the roTCA cycle, as can 
be judged from the ACL and CS activity mea- 
surements and ATP independence of citrate 
cleavage (table S9). The usage of the roTCA cycle 
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Fig. 2. >C NMR analysis 
of [U-“C,]citrate cleavage 


catalyzed by cell extracts 


of autotrophically grown 


D. acetivorans. The reac- 


tion mixture contained 

2 mM NADH, 2 mM 
CoA, and 2 mM citrate. 
(A) Reaction after O min. 
(B) Reaction after 10 min 
of incubation. (©) Sample 
as in (B), plus 5 ug of 
[U-8C,]malate reference. 
Signals and couplings 

of [U-’Ce]citrate, 
[U-3C,]malate, and 
[U-SCz]Jacetyl-CoA are 
indicated. For numerical 
values of chemical 

shifts and couplings, see 
table S2. Note that 
citrate is a heat stable 
compound and did not 


degrade nonenzymatically. 


ppm, parts per million. 


Fig. 3. A reversible TCA 
cycle in D. acetivorans 
and its efficiency in 
comparison with other 
autotrophic CO2 
fixation pathways. 

(A) roTCA cycle during 
growth on COs, Ho, and 
elementary sulfur (S°). 
(B) oTCA cycle during 
growth on acetate and 
S°. (C) ATP costs and 
estimated AG’° values for 
the synthesis of acetyl- 
CoA via the known auto- 
trophic COs fixation 
pathways (red,anaerobic; 
brown, microaerobic; 
blue, aerobic; see table 
$10 for details). Note that 
comparison of the 
pathways by their ATP 
costs does not take 

into account the 

costs for ferredoxin 
reduction and thus 
overestimates the ener- 
getic efficiency of 
anaerobic pathways. 
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synthase; 2, malate dehydrogenase; 3, fumarase; 4, fumarate reductase (roTCA cycle) or succinate dehydrogenase (oTCA cycle); 5, succinyl-CoA synthetase; 6, 
2-oxoglutarate synthase; 7, isocitrate dehydrogenase; 8, aconitase; 9, pyruvate synthase; 10, pyruvate carboxylase; 11, acetate:succinyl-CoA CoA-transferase; 
12, acetyl-CoA synthetase/acetate kinase + phosphate acetyltransferase. 


Mall et al., Science 359, 563-567 (2018) 


2 February 2018 


4 of 5 


Blog ‘| Areniqe+ uo /Bio Bewsoualds'a0ue!0s//:d}}y Wold pepeojumoq 


RESEARCH | REPORT 


could be a widespread trait of anaerobic auto- 
trophic organisms. Considering a wide distri- 
bution of its enzymes in anaerobes, the cycle 
could have evolved convergently in different mi- 
crobial groups. Fourth, CS is structurally simpler 
than ACL, which consists of several domains, 
with one of them being homologous to CS (3). 
Therefore, the roTCA cycle may even predate the 
modern ACL-rTCA cycle, with an ancestral TCA 
cycle being fully reversible. 

Our results show that unexpected discoveries 
are possible, even when studying well-known 
metabolic pathways. If the highly endergonic 
citrate synthase reaction can be reversed in vivo, 
which other apparently unidirectional metabolic 
reactions may be reversed under certain condi- 
tions? Despite the many advances of the omics 
era, these features cannot be identified bioin- 
formatically but require classical biochemical 
studies for their discovery. 
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High-energy, high-fat lifestyle 
challenges an Arctic apex predator, 


the polar bear 


A. M. Pagano,”?* G. M. Durner,’ K. D. Rode,’ T. C. Atwood,’ S. N. Atkinson,” 
E. Peacock,’ D. P. Costa,” M. A. Owen,* T. M. Williams” 


Regional declines in polar bear (Ursus maritimus) populations have been attributed to 
changing sea ice conditions, but with limited information on the causative mechanisms. 
By simultaneously measuring field metabolic rates, daily activity patterns, body 
condition, and foraging success of polar bears moving on the spring sea ice, we found 
that high metabolic rates (1.6 times greater than previously assumed) coupled with 
low intake of fat-rich marine mammal prey resulted in an energy deficit for more than 
half of the bears examined. Activity and movement on the sea ice strongly influenced 
metabolic demands. Consequently, increases in mobility resulting from ongoing and 
forecasted declines in and fragmentation of sea ice are likely to increase energy 
demands and may be an important factor explaining observed declines in body 


condition and survival. 


s the most carnivorous and only marine- 

living ursid, polar bears are lone among 

bears in their reliance on marine mammal 

prey. Evolution of this Arctic apex pred- 

ator included behavioral and physiolog- 
ical adaptations that distinguish polar bears 
from terrestrial bears (7), which has made them 
dependent on the sea ice and may increase their 
vulnerability to climate change (2). As a con- 
sequence of living in this labile marine hab- 
itat, polar bears occupy expansive home ranges 
that are considerably larger than those occupied 
by other ursids (3) or predicted for similarly 
sized terrestrial carnivores (4). They also ex- 
hibit remarkable abilities to swim for extended 
distances (5). However, such long-distance move- 
ments, whether walking or swimming, neces- 
sitate substantial energetic resources to satisfy 
locomotor demands (6, 7). Historically, sufficient 
resources were afforded through the availability 
of fatty, energy-dense seal prey, which could be 
hunted efficiently from the sea ice (8). Presently, 
the sea ice minimum extent across the Arctic is 
decreasing at a rate of 14% per decade, spring 
break-up is occurring earlier, and fall freeze-up 
is occurring later (9). This decline in sea ice is 
likely reducing access to, and abundance of, seal 
prey (10). 

Because metabolism determines the rate at 
which organisms require energy from their en- 
vironment (J7), measures of polar bear meta- 
bolic rates provide an important metric for 
linking declines in Arctic sea ice to polar bear 
survival. Changes in energy balance result- 
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ing from increased energy expenditure or 
reductions in foraging opportunities could 
lead to declines in body condition, survival, 
and reproductive success. Previous studies 
have reported that walking costs for polar 
bears are greater than predicted for other 
similarly sized mammals (6, 12). Yet, models 
predicting polar bear annual field metabolic 
rates (FMRs) assumed that FMRs would be 
relatively low because of the bear’s sit-and- 
wait style of hunting and theorized ability to 
reduce metabolism while fasting (13, 14), sim- 
ilar to that of denning bears (13, 15). However, 
measures of mass loss and body temperature 
suggest that metabolic rates of fasting polar 
bears during the summer are in fact greater 
than those of denning bears (16-18). Despite 
this improved understanding of fasting metabo- 
lism, no study has provided quantitative esti- 
mates of the FMRs of active polar bears on the 
sea ice where they spend most—and in many 
areas of their range, all—of the year. 

We measured the FMRs of female polar bears 
(n = 9) during April 2014-2016 in the Beaufort 
Sea (Fig. 1A) over 8 to 11 days each year using 
doubly labeled water (DLW) (supplementary 
materials) (19). On the same animals, we deployed 
global positioning system (GPS)-equipped video 
camera collars and archival loggers with tri- 
axial accelerometers and conductivity sensors 
to determine activity and behavior in order to 
assess the causes of variation in FMRs (Fig. 1) 
(19). We determined individual energetic ba- 
lance using video-derived observations of for- 
aging and measures of blood biochemistry, 
body mass, and body composition. Additionally, 
we measured the resting metabolic rate (RMR) 
of a captive adult female polar bear using open- 
flow respirometry to assess baseline energetic 
costs of the species relative to free-ranging polar 
bears and other mammals. 
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Like other members of the order Carnivora 
whose diet is exclusively meat (20), polar bears 
exhibit greater RMRs as compared with predic- 
tions for other terrestrial mammals (derived from 
omnivores, carnivores, and foliovores; RMR = 70 x 
mass°’”°) (Fig. 2) (21). RMRs measured from a 
single captive polar bear over six sessions averaged 
0.34 ml O, gh”! (SE = 0.01) with a low, post- 
absorptive (fasting) value of 0.30 ml O, g7* h™, 
which converts to 37.1 MJ day’. On a mass- 
specific basis, this post-absorptive RMR was 
within 0.5 to 11.4% of post-absorptive RMRs 
previously reported for subadult polar bears 
(22, 23), which is 17% greater than measure- 
ments from polar bears while denning (24) and 
2 to 21% greater than other ursids while resting 
or denning (Fig. 2) (19). Thus, carnivory and large 
body mass set a comparatively high maintenance 
cost for polar bears that must be satisfied to 
remain in energetic balance. 

As a result, the FMR of polar bears is high 
relative to predictions for terrestrial mammals 
generally (25) and consistent with expected levels 
derived solely from other mammalian carnivores 
(26). Our measured FMRs averaged 1.6 times pre- 
viously assumed values for polar bears (SE = 0.1, 
range = 1.0 to 2.6) (13, 14). Daily FMRs measured 
over 8 to Il days, averaged 0.45 ml CO, g7 h* 
(SE = 0.04, 2 = 9 bears), which converts to 51.6 MJ 
day” (SE = 6.2, n = 9 bears) or 12,324.7 kcal day”. 
These values ranged from 0.6 to 1.1 times predicted 
FMRs [mean (2) = 0.8, SE = 0.1] for similarly 
sized marine and terrestrial mammalian carnivores 
(26) and 2.5 to 5.2 times predicted RMRs based on 
body mass (# = 2.8, SE = 0.3) (Fig. 3A and table S1) 
(21). Daily FMR was 1.5 to 2.8 times the post- 
absorptive RMR (&# = 2.0, SE = 0.2) measured 
in this study. On average, daily FMR was 2.6 times 
(17) and 4 times (27) predicted values for male 
polar bears fasting on land. 

To remain in energy balance with these ele- 
vated metabolic demands, polar bears have evolved 
hunting tactics to prey on high energy-content 
prey, such as ringed seals (Pusa hispida), and 
preferentially feed on their energy-dense blubber 
(13). Using our measures of daily FMR, we esti- 
mated that a solitary female bear on the spring 
sea ice would on average need to eat either one 
adult ringed seal, three subadult ringed seals, or 
19 newborn ringed seal pups every 10 to 12 days 
to remain in energetic balance (Fig. 4A). Our esti- 
mates corroborate observations that bears in the 
early summer typically kill an adult or subadult 
ringed seal every 5 days (8). However, this rate of 
consumption would be necessary simply to ener- 
getically break even. Polar bears put on the major- 
ity of their body fat in the late spring and early 
summer (73) and can reach a relative fatness of 
1 kg fat per kg lean body mass (28). To obtain 
this body condition, bears would either need to 
reduce their energy demands or increase their 
rate of food consumption. 

Using video collar data, we documented bears’ 
hunting behavior and foraging success. Bears 
used sit-and-wait tactics to hunt seals 90% of 
the time, and stalking comprised the remain- 
ing 10% of hunts (movies S1 to S4) (19). Bears 
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that successfully killed and ate adult or sub- 
adult ringed seals either gained or maintained 
body mass, whereas bears that only scavenged 
or showed no evidence of eating lost mass (Fig. 4, 
B, C, and D). Land-based remains of subsistence- 
harvested bowhead whale carcasses (Balaena 
mysticetus) provided an alternative food resource 
for two bears (Fig. 4, B and D), although 91% of 
their telemetry data were on the sea ice and away 
from this resource. Scavenging the muscle from 
previously killed seals was also common (Fig. 4D) 
and consistent with previous observations of polar 
bear foraging on the sea ice (8, 13). 

Four bears lost =10% of their body mass over 
the 8- to 11-day period (Fig. 4B and table SI), with 
an average loss of 1% per day (1.95 kg day”). This 
is 4 times the percent mass lost per day and 
2.2 times the kilograms lost per day documented 
in fasting polar bears on land (16), which typi- 
cally exhibit lower activity levels than those of 
the bears in this study (J8). Nevertheless, this 
change in body mass is within the range of the 
percent mass loss observed in other carnivores 
(29, 30). Of these four bears, one lost a greater 
amount of lean body mass than fat mass (Fig. 4B 
and table S1), which suggests that she was in a 
prolonged fasting condition (16). This was con- 
firmed with changes in serum urea/creatinine 
(U/C) ratios (an indicator of fasting for >’7 days) 
(table S1) (19, 31. Additionally, changes in U/C 
ratios over the 8- to 11-day period were strongly 
positively correlated with changes in body mass 
[coefficient of determination (R”) = 0.62, P = 0.01, 
n = 9 bears]. 

Despite the use of efficient sit-and-wait hunt- 
ing tactics, polar bears in the spring exhib- 
ited activity patterns typical of other large 
terrestrial carnivores (19). Activity rates (the 
percentage of time engaged in nonresting be- 
haviors) derived from accelerometers, which 
were recorded continuously throughout each 
day, ranged from 22 to 40% @ = 34%, SE = 2.8, 
n = 6 bears) (fig. S2C). Bears spent 28% of 
the time walking (SE = 2.3, n = 6 bears) and 
only 0.3% of the time swimming (SE = 0.15, 
n = 6 bears) (fig. S2C). Activity rates derived 
from video collars (which recorded only during 
daylight hours) ranged from 13 to 60% of 
the day (# = 32%, SE = 5.1, n = 9 bears) (fig. 
S2D). These activity rates are similar to those 
observed for adult and subadult polar bears 
on the summer sea ice (34.6% of the time 
active) (8) but are greater than observations 
from a breeding adult female polar bear on 
the spring sea ice (26.7% active) (32). Breeding 
females are known to exhibit reduced activity 
and devote less time to hunting as compared 
with that of nonbreeding individuals (32). In 
the present study, seven of the nine bears in- 
teracted with at least one adult male. How- 
ever, the activity levels we observed were greater 
than measures derived from activity sensors 
on adult female polar bears on the sea ice of 
the Beaufort Sea during April (25% active) (18) 
and solitary female polar bears on the sea ice 
in April (16.9% active) in the Canadian Arctic 
(33), indicating marked variability in the ac- 
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tivity levels of this species depending on sea- 
sonal, geographical, and reproductive factors. 
The activity levels for polar bears were less than 
the activity levels documented in other bear 
species (@ = 54% active) (34) but were similar to 
activity levels reported for other large terres- 
trial carnivores (% = 39% active) (supplemen- 
tary materials) (19). 

Ultimately, the ability of polar bears to achieve 
energy balance is dictated by the acquisition of 
metabolizable energy versus expenditure from 
basal metabolism, specific dynamic action, ther- 
moregulation, reproduction, growth, and locomo- 
tion (35). We found that variation in daily FMR 
was primarily influenced by positive relationships 
with body mass (daily FMR = 0.0002 x mass“, 
R? = 0.91, P < 0.001, n = 9 bears) (Fig. 3A), 
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movement rate (daily FMR = 167.3 x rate + 
153.0, R® = 0.82, P < 0.001, n = 9 bears) (Fig. 3B), 
and activity rate derived from video collars (daily 
FMR = 336.73 x activity + 180.5, R° = 0.60, P = 
0.01, n = 9 bears) (Fig. 3C). We found a positive 
but nonsignificant relationship between activity 
rate derived from accelerometers and daily FMR 
(R? = 0.56, P = 0.09, n = 6 bears). We further 
found a positive relationship between overall 
FMR and total distance traveled (overall FMR = 
0.006 x distance + 1.5, R® = 0.78, P = 0.003, n = 
8 bears) (Fig. 3D). These relationships suggest that 
basal metabolism, locomotion, and activity were 
the primary drivers of energy expenditure for the 
polar bears in our study. This reinforces that 
there is a substantial cost of locomotion in polar 
bears relative to other quadrupedal mammals, as 


eae @ Capture 
S Recapture 


Polar bear standing 


12 


Time (s) 


Fig. 1. Field movements and accelerometer signatures of polar bears in April 2014 to 2016. 
(A) Capture and recapture locations and GPS movement paths of nine female polar bears dosed with 
DLW and equipped with GPS-equipped video camera collars and archival loggers with triaxial 
accelerometers and conductivity sensors. Gray area denotes land; white area shows sea ice cover. 
(Inset) Orientation of the accelerometer while attached to the video collar. (B) Accelerometer 
signatures of static acceleration in the surge (x), heave (y), and sway (z) directions and overall 
dynamic body acceleration (ODBA) while walking, swimming, and standing. Images show the 
corresponding behaviors derived from the animal-borne video camera. 
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Fig. 2. Relationship between body 
mass and RMR in polar bears and 
other ursids. RMR of an adult female 
polar bear (orange triangle, this study), 
compared with mean RMRs of subadult 
polar bears (orange circle), hibernating 
adult female polar bears (orange 
square), hibernating adult male brown 
bears (blue square), black bears (black 
circle), hibernating black bears (black 
square), adult and subadult panda 
bears (green circle), and female sloth 
bears (purple circle). Sources are avail- 
able in the supplementary materials. 
The dashed line is the allometric 
regression for RMR in vertebrate-eating 
carnivores (20). The solid line is the 
allometric regression for RMR in euthe- 
rian mammals (21). (Inset) The adult 
female bear resting in the metabolic 
chamber from the present study. 


Resting metabolic rate (MJ ‘day ') 


Fig. 3. Daily FMR and overall 


FMR in relation to body mass, = & 
movement, and activity rate of 5 = 
polar bears. (A) Mean daily 3 = 
FMRs of female polar bears onthe ‘@ 4° 
sea ice in relation to body mass. e 3 = 
The allometric regression (solid line) z= = = 
is compared with predicted daily Zz - 
FMRs for marine and terrestrial ze 38} 
mammalian carnivores (upper 4 


dashed line) (26), and predicted = 
daily RMRs (lower dotted line) (21). 
B) Least squares regression (solid 
ine) of mean daily mass-specific 
FMR in comparison with mean 
movement rate. (C) Least squares 
regression (solid line) of mean daily 
mass-specific FMR in comparison 
with mean activity rate derived from 
video collars. (D) Least squares 
regression (solid line) of overall 
mass-specific FMR in comparison 
with total distance moved over 8 to 
11 days. Regression statistics are 
provided in the main text. Each 
point represents a single value for 
one bear in (A) to (D). 
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has been documented in captive treadmill studies 
(6). For example, bears with movement rates 
averaging >1.0 km h ‘(nv = 3 bears) had 1.5 times 
greater FMRs than those of bears that moved 
<1.0 kmh”! (n = 6 bears) (Fig. 3B). 
Admittedly, the activity levels and FMRs in 
this study may be biased low owing to the effects 
of recovery post-capture. On the basis of move- 
ment rate and activity sensor data, recovery post- 
capture for polar bears may last 2 to 3 days (36). 
Although one bear in this study successfully caught 
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and ate an adult or subadult ringed seal less than 
24 hours after being collared (fig. S2B) (29), most 
bears exhibited lower activity rates during the first 
24 hours after capture (fig. S3) (19). Hence, the 
FMRs and activity budgets presented in this study 
should be considered conservative levels for free- 
ranging polar bears. 

We found that polar bears in the spring ex- 
hibit greater energetic demands than those of 
previous predictions (13, 14) both for mainte- 
nance functions and locomotion. Similar to other 
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marine mammals, polar bears likely transitioned 
to the marine ecosystem to take advantage of 
abundant prey resources (7) despite the increased 
energetic costs required for a marine and car- 
nivorous existence (37). These demands neces- 
sitate access to high-energy-content prey in the 
form of ringed and bearded seals (Erignathus 
barbatus). 

More than half of the bears in this study 
lost body mass, meaning that over the period 
of observation, their energy demand exceeded 
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available for bears five to nine (19). Bar heights represent data for individual bears. (C) Image of a polar bear eating a recently killed ringed seal. 
(D) Prey type consumed by female polar bears. Bar heights represent data for individual bears. 


that gained by consuming prey. Although we 
cannot assess the effects of post-capture recov- 
ery on our observed foraging rates, previous 
researchers reported that 42% of adult fe- 
male polar bears in the Beaufort Sea during 
the spring from 2000 to 2016 had not eaten 
for =7 days before capture (38). This rate of 
fasting was 12% greater than measurements 
from 1983 to 1999 (38), suggesting that spring 
ice conditions are affecting prey availability 
for polar bears even before the summer open 
water period. Additionally, access to optimal 
habitats (annual ice over the continental shelf) 
is expected to and in some areas has already 
declined as a result of climate change (39, 40). 
Survival rates of cubs, body condition of adult 
females, body size of young, litter mass, and 
yearling numbers have also exhibited declines 
in some regions of the Arctic (41, 42). Together 
with our data on the cost of activity and en- 
ergy acquisition (Figs. 3 and 4), these studies 
suggest that an increasing proportion of bears 
are unable to meet their energy demands. Our 
results indicate that further increases in ac- 
tivity and movement resulting from declin- 
ing and increasingly fragmented sea ice are 
likely to increase the demand side of the energy 
balance ratio (43). Inherently high energy de- 
mands create a physiological constraint that 
makes it difficult for polar bears to compen- 
sate for both increases in activity and declines 
in the availability of energy-dense prey as hab- 
itats become more fragmented (44). Hence, 
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increases in movement and activity rates me- 
diated by the loss of sea ice habitat are likely 
to have negative cascading effects on polar 
bear reproductive success and, ultimately, their 
populations. 
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Polarity compensation mechanisms 
on the perovskite surface KTa0;(001) 


Martin Setvin,’* Michele Reticcioli,” Flora Poelzleitner,’ Jan Hulva,' Michael Schmid," 
Lynn A. Boatner,? Cesare Franchini,” Ulrike Diebold’ 


The stacking of alternating charged planes in ionic crystals creates a diverging electrostatic 
energy—a “polar catastrophe”—that must be compensated at the surface. We used scanning 
probe microscopies and density functional theory to study compensation mechanisms 

at the perovskite potassium tantalate (KTaO3) (001) surface as increasing degrees of freedom 
were enabled. The as-cleaved surface in vacuum is frozen in place but immediately responds 
with an insulator-to-metal transition and possibly ferroelectric lattice distortions. Annealing in 
vacuum allows the formation of isolated oxygen vacancies, followed by a complete 
rearrangement of the top layers into an ordered pattern of KO and TaOz> stripes. The optimal 
solution is found after exposure to water vapor through the formation of a hydroxylated 


overlayer with ideal geometry and charge. 


llionic crystals can be considered as stacks 

of alternatingly charged lattice planes along 

certain crystallographic directions. When a 

crystal is terminated perpendicular to such 

a “polar” direction, the electrostatic energy 
diverges (1, 2). The resulting instability can be 
compensated in various ways—for example, by 
charge transfer, structural reconstructions, changes 
in the surface morphology, or chemical doping 
(3-5). Such compensation mechanisms often 
create unusual physical and chemical materials 
properties (6, 7). 

Surface and interface polarity has been dis- 
cussed controversially, particularly for perovskite 
oxides. These materials have a chemical formula 
ABOs, where the larger A cation is 12 coordinate 
and the smaller B cation is 6 coordinate. Along 
the [001] direction, they contain AO and BO, 
planes that in many cases carry formal positive 
and negative charges. Tantalates and niobates 
(B is Ta or Nb) are prototypical examples of 
polar perovskites. Many exhibit (incipient) ferro- 
electricity (8), providing attractive options for 
electronics and sensors. The built-in field can 
also enhance electron-hole separation in light- 
harvesting schemes (9); indeed, these materials 
are highly efficient photocatalysts (10) with a re- 
cord quantum efficiency for photochemical water 
splitting >50% (1D. 

Because surfaces play a major role in all these 
applications, it is important to understand which 
response the system “selects” to relieve the elec- 
trostatic instability. We studied the polar (001) 
surface of KTaO, as a representative case using 
atomically resolved microscopy and spectroscopy, 
integral surface analysis techniques, and density 
functional theory (DFT). We show that KTaO.(001) 
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undergoes an entire series of routes to polarity 
compensation. 

A scanning tunneling microscopy (STM) image 
of the surface of a KTaO3(001) single crystal after 
being cleaved in ultrahigh vacuum (UHV) shows 
three steps running horizontally along the [100] 
direction (Fig. 1A). These steps have a height of 
~0.4 nm—i.e., the KTaOz lattice constant. The 
entire area is covered by alternating KO and TaO, 
terraces separated by half-unit-cell steps. Inter- 
estingly, these terraces run uninterrupted across 
the full-height steps. The KO planes are always on 
top, whereas TaO,j is in the valley region (see the 
sketch in Fig. 1D); apparently, the KO plane frac- 
tured more easily during cleavage. The KO and 
TaO, terminations have a formal charge of -le 
and +le per unit cell, respectively. The characteristic 
terrace size is 3 to 8 nm, and their ratio is 1:1. 
Thus, the top layer is polarity-compensated on 
a long-range scale (2). 

Although the electronic structure appeared 
delocalized with no measurable atomic corruga- 
tion in STM, noncontact atomic force microscopy 
(nc-AFM) readily showed at atomic resolution a 
perfect (1x1) bulk-terminated structure with a low 
defect concentration. In a constant-height AFM 
image (Fig. 1B), the darker regions with atomic 
resolution correspond to the higher-lying KO 
terraces, and bright, uniform regions are lower- 
lying TaO, terraces. All of the AFM images were 
taken in constant-height mode, and on TaO, ter- 
races the attractive forces were smaller because 
of the larger tip-surface distance. The AFM imag- 
ing mechanism on the KO terraces appears 
analogous to the prototypical NaCl(001) surface 
(12), which has an identical arrangement of sur- 
face ions. [Imaging the TaO, on the lower ter- 
races proved more difficult; see fig. S1 (13).] 

Scanning tunneling spectroscopy (STS) (Fig. 1C) 
showed that the two terminations had distinctly 
different electronic structures. The KO terraces 
had a wide band gap spanning from -3.5 to +1 V, 
whereas TaO, terraces appeared metallic (red/ 
black curves in Fig. 1C), with a shallow state at 
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-0.2 eV and several deeper-lying states. These in- 
gap states did not appear on the smaller TaO. 
terraces (below ~8 unit cells size), and were also 
absent in the border area (~2 unit cells wide) 
around the KO islands (see the green spectrum 
in Fig. 1C). Figure 1E shows a representative spatial 
distribution of the in-gap states (more details are 
in figs. S2 and S3). These states exhibited a char- 
acteristic wavelike pattern with ~1.6 nm perio- 
dicity, indicative of their delocalized (bandlike) 
character (14). The derived wave vector is con- 
sistent with reported photoemission data (15). 

The electric charge in the in-gap states counter- 
acted the electrostatic potential arising from the 
TaO,* polarity, but these states formed only when 
the terrace width (and related electrostatic po- 
tential) exceeded a certain critical limit. The sit- 
uation bears similarities to the formation of the 
two-dimensional electron gas (2DEG) at polar 
interfaces, where a critical film thickness is neces- 
sary (6, 16). These in-gap states already form on 
as-cleaved surfaces, which do not show any de- 
fects, and DFT calculations indicate a metalliza- 
tion of the surface (figs. S4 and S5). 

The uncompensated electrostatic potential is 
a driving force for further polarity-compensation 
mechanisms, and we estimated its magnitude 
using Kelvin probe force microscopy (KPFM) (see 
Fig. 1F and fig. S6). On as-cleaved surfaces, the 
local contact potential varied by ~0.3 V. Defect-free 
surfaces were only achieved for samples cleaved 
below room temperature. Annealing the surface 
in vacuum resulted in the development of more 
efficient polarity-compensation mechanisms. After 
raising the sample temperature to 312 K, defects 
formed on the KO terraces (Fig. 2, A and B). AFM 
images of these defects agreed well with Cl va- 
cancies observed on the prototypical NaCl(001) 
surface (12) (see fig. S7). Accordingly, we attributed 
the defects here to O vacancies, the correspond- 
ing formal charge of +2e being suitable for com- 
pensating the negative charge of the KO regions. 
The vacancies appeared in central regions of larger 
terraces—i.e., at the locations with the highest 
electrostatic potential. The O vacancies were likely 
created by the migration of O atoms from the KO 
terraces toward the interface with TaOs. 

Annealing at higher temperature (7 = 387 K in 
Fig. 2C) resulted in small “holes” in the central 
regions of the KO terraces. The steps aligned 
more along <110>, the nonpolar step direction 
of KO (see the structural model in Fig. 2G). Upon 
further annealing to 482 and 517 K, the holes in- 
terconnected (Fig. 2, D and E, respectively), and 
formed a labyrinth-like structure. At the atomic 
scale, the surface still had the original bulk (1x1) 
termination, and KO and TaO, terraces alter- 
nated in a quasi-periodic pattern separated by 
half-unit steps. The ratio of KO:TaO, remained 
1:1, indicating that the process is only caused by KO 
rearrangement without desorption or diffusion 
from or to the bulk. KPFM measurements (fig. 
S6) showed that the O-vacancy formation reduced 
the original electrostatic potential by half, and 
after the surface rearranged into the labyrinth 
structure, the measured potential variations were 
negligible. 


1 of 4 


810g ‘1 Areniqe4 uo /610'beweoualos‘eouelds//:di]u WO. pepeo|uMOGg 


RESEARCH | REPORT 


The labyrinth structures had a characteristic 
terrace width of ~1.5 nm (4 to 5 atoms), which 
did not change upon further annealing. The 
pattern was remarkably stable up to 700 K. At 
even higher temperatures, the ordering was lost 
(fig. S8), likely because of K segregation from the 
bulk (77). The “magic” size of the stripes in Fig. 2, 
D to F, likely originated from a competition be- 
tween two driving forces: Minimizing the terrace 
size suppressed the electrostatic potential arising 
from the polarity, whereas steps cost energy. This 
hypothesis was tested by calculating the surface 
energy and electrostatic potential as a function of 
terrace width. The structural model consisted of 
alternating KO and TaO, terraces of equal widths 
w (where w equals the number of K atoms in the 
2D unit cell), separated by steps running in the 
<I10> direction (Fig. 2G). The energy minimum 
was achieved for a w of 4 to 5 atoms (red curve in 
Fig. 2H), in agreement with the experiment. The 
electrostatic potential changed smoothly in the 
range 1 < w < 8 and then started to deviate 
rapidly for higher w, indicating increasing po- 
lar instability. 

Considering these calculations, the exact polarity- 
compensation mechanism of the as-cleaved KO 
terraces remains an open question. The low de- 
fect density and the absence of an electronic re- 
construction should result in a relatively high 
electrostatic potential (>1 V), yet the experimental 
KPFM data (Fig. IF and fig. S6) showed con- 
siderably smaller potential corrugations. The 
incipient-ferroelectric character of the material 
could contribute to the polarity compensation, 
allowing for generation of electric fields inside 
the material at no energy cost. Our experimental 
indications are indirect, but the calculations 
showed ferroelectric-like lattice distortions (fig. S9). 
Further, the sample cleaving always resulted in 
a very characteristic layout of the KO/TaO, ter- 
races that crossed full steps without interruption 
(Fig. 1A). Strain-induced (78) spinodal decomposi- 
tion into ferroelectric domains could occur during 
the cleavage, and the resulting terrace layout 
would be a “printout” of these domains (fig. S9). 

The most efficient polarity-compensation mech- 
anism developed after exposing the surface to H,O 
vapor (Fig. 3). Water induced a complete restruc- 
turing of the surface layer, even in UHV and at 
room temperature. After dosing 300 Langmuir 
(L) of water vapor at 325 K, the surface was flat 
and had only full-unit-cell steps (Fig. 3A). More 
detailed AFM (Fig. 3B) showed a homogenous 
phase with a (2x1) symmetry; low-energy electron 
diffraction (fig. S10) confirmed that the entire 
surface had been transformed. 

This transition required high doses of water; 
Fig. 3C shows a surface exposed to 100 L H,O. 
Here, the occasional small island of the original 
bulk-terminated KO was found, interfaced with 
the (2x1) overlayer. The (2x1) termination could 
be switched back to the (1x1); see Fig. 3D. An- 
nealing above ~488 K in UHV resulted in water 
desorption and formation of the same labyrinth- 
like structure of alternating KO/TaO, terraces as 
shown in Fig. 2, confirming reversibility between 
these two surface phases. 
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To determine the chemical composition of the 
(2x1) superstructure, low-energy He” ion scat- 
tering is an ideal method because it is only sen- 
sitive to the very top layer of the surface. The red 
curve in Fig. 3E was measured on an as-cleaved 
surface. As expected, O, K, and Ta are present. 
After dosing with water, the Ta signal completely 
disappeared (blue curve). X-ray photoelectron 
spectroscopy (fig. S11) revealed the hydroxylated 
character of the (2x1) termination and also ex- 
cluded any possible contamination from the water 
exposure. It appears that water dissolved the KO 
islands and redistributed them across the TaO, 
planes. A proposed structural model is shown in 


| 
coe Close = KO 


—— Terrace center Ta O 2 


~~ (higher precision) 


log(/,) (const. height), V. = -0.05 V, 38x38 nm? 
range: 0.1 to 10 pA 


Fig. 3F. Each KO unit reacted with one H,O mole- 
cule, resulting in one K(OH), unit per (2x1) unit cell. 
This termination was also polarity-compensated, 
with a net charge of -0.5e per (1x1) unit cell (2). A 
(2x1) termination of KTaO,(001) has already 
been reported (79) and investigated theoretically 
(20), but the presence of water had not been 
considered. 

The KTaO,(001) surface exhibited a rich spec- 
trum of mechanisms for compensating the surface 
polarity, and a proper control of the environment 
can be used to tailor which one is at work. The 
phenomena observed on the as-cleaved surfaces— 
ie., electronic reconstruction and ferroelectric 


-3 Hz 
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Fig. 1. Cleaved KTaO3(001). (A) Overview STM image of a surface prepared by cleaving at T = 261 K. 
(B) Atomically resolved AFM image. (C) STS spectra measured above the KO and TaOz terraces. 

(D) Atomic structure of the TaOz and KO terraces and a schematic drawing of the cleaving. (E) Spatial 
distribution of in-gap states, measured in the region marked by a dashed square in (A). Dashed lines 
mark steps. (F) KPFM image of a different region, showing the local electrostatic surface potential. 
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Fig. 2. Progression of surface structure with increasing annealing temperature. (A to E) AFM images of the surface after annealing in vacuum to 312, 341, 
387, 482, and 517 K, respectively (measured at 4.8 K). The inset in (B) shows a larger-area STM image of the region. (F) Overview STM image of the surface 
shown in (E); the corresponding area is marked by a dashed square. (G) Top view of the model used for the DFT calculations. The unit cell is marked by a rectangle. 
(H) Calculated surface energy and electrostatic potential in the center of the TaOz terrace as functions of the terrace width. 
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Fig. 3. Water-induced, hydroxylated (2x1) surface layer. (A) Overview 
AFM image of the KTaO3(001) surface after exposure to 300 Langmuir 
(L) H20 at T = 325 K. (B) Atomically resolved detail, showing a 
homogeneous superstructure with (2x1) ordering. (C) Surface exposed to 
100 L H20 at T = 298 K, showing the interface between an original 

(1x1) KO terrace and the (2x1) phase. (D) The (2x1) surface after 
annealing to 488 K (inset shows an atomically resolved AFM image). 

(E) Low-energy He® ion scattering spectra of the as-cleaved, bulk- 
terminated surface and the (2x1) overlayer. (F) Atomic model of the (2x1) “2.5 -2 “1.5 =f -0.5 0) 
overlayer. (G) Phase diagram for the transition between the (1x1) bulk AUH,0 (eV) 

termination and the (2x1) structure. 
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lattice distortions—are well known from perov- 
skite heterostructures. We could directly probe 
the character of the 2DEG-like states and their 
spatial confinement to the highly polar regions. 
The labyrinth-like pattern formed upon anneal- 
ing represents an interesting template for charge- 
carrier separation in light-harvesting devices, but 
protection against environmental influences would 
need to be facilitated for practical use. Although 
such a structure minimizes the surface energy, 
the corresponding energy gain is substantially 
lower compared with chemically induced polarity 
compensation by water. The phase diagram in Fig. 
3G shows that the mechanisms encountered in 
UHV decrease the surface energy by tenths of 
electron volts, whereas hydroxylation brings al- 
most one electron volt (at conditions correspond- 
ing to ambient water pressures). Perovskite surfaces 
in ambient conditions are typically hydroxylated 
and near the bulk termination, but little is 
known about the exact surface structure. The 
(2x1) reconstruction observed here could there- 
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fore serve as a working model for future studies 
into the surface chemistry of perovskite oxides. 
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SUPERCONDUCTIVITY 


Probing optically silent superfluid 


stripes in cuprates 


S. Rajasekaran,’* J. Okamoto,” L. Mathey,” M. Fechner,’ V. Thampy,” 


G. D. Gu,? A. Cavalleri?** 


Unconventional superconductivity in the cuprates coexists with other types of 
electronic order. However, some of these orders are invisible to most experimental probes 
because of their symmetry. For example, the possible existence of superfluid stripes is 
not easily validated with linear optics, because the stripe alignment causes interlayer 
superconducting tunneling to vanish on average. Here we show that this frustration 

is removed in the nonlinear optical response. A giant terahertz third harmonic, 
characteristic of nonlinear Josephson tunneling, is observed in La; 3g5Bao115CuO, above 
the transition temperature T, = 13 kelvin and up to the charge-ordering temperature 

Teco = 55 kelvin. We model these results by hypothesizing the presence of a pair density 
wave condensate, in which nonlinear mixing of optically silent tunneling modes drives 


large dipole-carrying supercurrents. 


ingle-layer cuprates of the type Laz_»_,(Ba, 

Sr),(Nd,Eu),CuO, exhibit an anomalous 

suppression of the superconducting tran- 

sition temperature T, for doping levels 

near 12.5% (1). Several studies have shown 
that this suppression coincides with the for- 
mation of “stripes,” one-dimensional chains of 
charge rivers separated by regions of oppositely 
phased antiferromagnetism (2, 3). A schematic 
phase diagram of Lay_,,Ba,CuO,, adapted from 
(J), is shown in Fig. 1, along with a sketch of the 
stripe geometry. 

Recently, high-mobility in-plane transport was 
found in some of these striped phases at temper- 
atures above the bulk superconducting T, (4). 
The existence of a striped superfluid state with 
a spatially oscillating superconducting order pa- 
rameter, a so-called pair density wave (PDW) state, 
was hypothesized (5) to explain the anomalously 
low in-plane resistivity. 

Superfluid stripes are difficult to detect. Scan- 
ning tunneling microscopy (STM) experiments have 
reported spatial modulations in the superconduct- 
ing condensate strength of Bi,Sr,CaCu,Og,,, (6). 
However, STM is not sensitive to the phase of 
the order parameter, and these low-temperature 
measurements did not clarify the broader ques- 
tion of whether finite momentum condensation 
may be taking place above T,. Also, according to 
the pair density wave model, superfluid trans- 
port perpendicular to the planes is frustrated 
owing to the stripe alignment (Fig. 1, inset). Hence, 
these stripes are invisible in linear c-axis optical 
measurements (7). 
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In this report, we show that superfluid stripes 
display characteristic signatures in the nonlinear 
terahertz frequency response. In LayggsBao45CuO4, 
we detect superfluid stripes above T, = 13 K and 
up to Tio = 55 K. 

Figure 2 displays the linear and nonlinear tera- 
hertz reflectivity spectra for two Laj_7)Ba,CuO04, 
crystals (@ = 9.5 and 15.5%), measured with single- 
cycle pulses (8) polarized along the c axis [see 
section S1 of (9)] and covering the spectral range 
between 150 GHz and 2.5 THz [see section S2 of 
(9)]. At these doping levels, the material exhibits 
homogeneous superconductivity and only weak 
stripe order, with transition temperatures of T,, ~ 
T. = 34K (@ = 9.5%) and T,, ~ 40 K > T, = 32 K 
(x = 15.5%). 

The low-temperature linear reflectivities (T = 
5K <T,), show Josephson plasma edges at @spo = 
500 GHz and spp = 1.4 THz for x = 9.5 and 15.5%, 
respectively (see dashed curves in Fig. 2, A and 
C). The reflectivity edges shifted to lower fre- 
quencies with increasing temperature, indicat- 
ing a decrease in superfluid density [Fig. 2, B and 
D, and section S2 of (9)]. 

Nonlinear reflectivities, measured at field 
strengths between 20 and 80 kV/cm (see Fig. 2, 


Fig. 1. Phase diagram for Laz_,Ba,CuO,. 
SC, SO, and CO denote the bulk super- 
conducting, spin- and charge-ordered 
(striped), and charge-only—ordered 
phases, respectively. Tc, Tso, aNd Teo 

are the corresponding ordering temper- 
atures. T+ denotes the orthorhombic- 
to-tetragonal structural transition 
temperature. The samples examined in 
this study are x = 9.5, 11.5, and 15.5% 
(dotted lines). Further, a schematic stripe- 
ordered state is shown wherein the tan 
stripes depict the charge rivers and the 
gray stripes depict the antiferromagnetic 


Temperature (K) 


A and C) displayed two characteristic effects of 
nonlinear interlayer Josephson coupling: (i) a field- 
dependent red shift of the plasma edge (10-12) and 
ii) a reflectivity peak at the third harmonic of the 
pump field @ ~ 3 ump (3). The third-harmonic 
field amplitude E(3@pump) scaled with the cube 
of the incident field strength E(@pump) (Fig. 2E), 
decreased in strength with temperature, and 
disappeared for T = T,, tracking the superfluid 
density mo2(m—0) (see Fig. 2, C, D, and F). 
These observations are well understood by 
a semiquantitative analysis of the interlayer 
phase dynamics of a homogeneous layered su- 
perconductor (10, 12, 14). For a c-axis electric field 
E(t) = Eosin(®pumpt) the interlayer phase dif- 
ference 0(¢) advances in time, t, according to 
the second Josephson equation 0) = aU 
where d is the interlayer spacing (~1 nm), 2e is 
the Cooper pair charge, and / is Planck’s con- 
stant h divided by 27 (15). Because the c-axis 
superfluid density p, scales with the order pa- 
rameter phase difference p, ° cos0 and because 
Pe & w;”, the plasma frequency renormalizes 
aS Wj” = Wypo”COS(0) = @sp97COS[O9COS(Wpumpt)] * 
oe 8? sores Fona) wher Qo = Datla! 
h®pump» an average red shift of the equilibrium 
plasma resonance @;po is hence estimated as oj? = 


2 
spo" (a — ar Secondly, a tunneling supercur- 


rent, J, is excited at the first and third harmonic 
of the driving field, Z(t) = I-sin[®9cos(@pumpt)] * 


80° sii : 
I, 820 ae!) — =F cos? (pump?) |, giving rise 


to third harmonic radiation. Finally, because 
the third-harmonic signal is proportional to [,, 
it is expected to follow the same temperature 
dependence as the superfluid density. 

More comprehensive numerical simulations, 
based on space (x, not to be confused with the 
symbol for doping concentration)- and time 
(t)-dependent one-dimensional sine-Gordon 
equation for the Josephson phase 6@(2,t) [see 
section S3 of (9)] (12, 14, 16, 17), were used to 
obtain the electromagnetic field at the surface of 
the superconductor and to calculate the reflectiv- 
ity for arbitrary field strengths. These simulations, 
reported in Fig. 3, reproduce the experimental 
data closely. 


11.5 12.5 13.5 
Hole doping % 


insulating region (inset). Figure adapted with permission from (1). 
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Fig. 2. Third harmonic 
from homogeneous 
superconductor. (A) Linear 
and nonlinear reflectivity of 
Las_,Ba,CuO 4, where x = 
9.5%, measured at T=5K 
with ®pump = 490 GHz. The 
linear reflectivity displays 

a Josephson plasma edge at 
@ po = 500 GHz (0.5 THz), 
whereas the nonlinear 
reflectivity shows a red shift 
of the edge and third- 
harmonic generation (red 
shading). (B) Temperature 
dependence of nonlinear 
reflectivity for x = 9.5%. The 
third-harmonic peak disap- 
pears above T, = 34 K. 

(C) Linear and nonlinear 
reflectivity for x = 15.5%, 
measured with @pump = 

700 GHz. (D) Temperature 
dependence of third- 
harmonic generation in x = 0.2 on 
15.5%. Third-harmonic gen- 
eration (red shading) disap- 
pears above T, = 32 K. 


Reflectivity 


0.5 


0.6 


Reflectivity 
S S 
n oa 


° 
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Fig. 3. Simulated nonlinear reflectivity 
for homogeneous superconductivity. 
Simulations at x = 9.5 and 15.5% doping. 
(A) Simulated space (x, not to be 

confused with the symbol for doping 
concentration)— and time (t)—dependent 
order parameter phase [8(x, t)] obtained by 
numerically solving the sine-Gordon 
equation on x = 9.5% samples [see section 
S3 of (9)]. The equation makes use of 
equilibrium superfluid density extracted from 
the linear optical properties and assumes 
excitation with terahertz pulses of shape 
and strength used in the experiment. 

The horizontal dotted lines indicate the 
spatial coordinate x at which the line 

cuts are displayed (lower panel). 

(B) Simulated order parameter phase 

(A) after frequency filtering centered at 
3@pump with its corresponding line 

cut (lower panel). (©) Simulated reflectivity 
in the linear (E = 0.1 kV/cm) and the 
nonlinear (E = 80 kV/cm) regime. The 
third-harmonic generation component is 
highlighted (red shading). (D to F) Same 

as in (A) to (C) but for x = 15.5%. 


Rajasekaran et al., Science 359, 575-579 (2018) 


—— Nonlinear 
---- Linear 


1 


1 
Frequency (THz) 


(E) Third-harmonic electric field strength (normalized to the highest signal) 
plotted as a function of the incident electric field strength [definition explained 
in section S1 of (9)] measured at T = 5 K from the x = 9.5% sample. The 
third-harmonic field displays a cubic dependence on the incident field strength. 
(F) Temperature dependence of the third-harmonic amplitude (normalized 
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to the measurement at T = 5 K) from the x = 9.5 and 15.5% doping. The 
superfluid density [(@o2(6>0)] (normalized to the measurement at T = 5 K) 
extracted from the linear optical properties of the x = 9.5% sample is also 

shown. All of the quantities vanish above T,. Red shading indicates the 
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Fig. 4. Third-harmonic generation in the striped phase. (A) Nonlinear 
frequency-dependent reflectivity measured in the striped x = 11.5% 
samples recorded for three different field strengths at T = 5 K (<7, = 13 kK). 
(B) Electric field dependence of the third-harmonic amplitude for T = 5 K. 
(C) Temperature dependence of the nonlinear reflectivity for T > 7, = 13 K. 
(D) Temperature dependence of the third-harmonic signal (normalized to 
the highest field measurements at T = 5 K). (E) Schematic of the order 
parameter phase in a pair density wave condensate. The black arrows 
represent the superconducting order parameter phase at each lattice point. 
Interlayer tunneling from perpendicularly aligned superfluid stripes is 


equivalent to a checkerboard lattice of alternating m/2 and -11/2 phase 
Josephson junctions. Such a lattice has tunneling currents of /, and —I, 
flowing at the neighboring junctions at equilibrium (thick red lines). 

(F) Excitation modes of the PDW indicating the @o and @,, modes (see text). 
The shaded region under the black arrow represents the phase excursion 
from the equilibrium geometry (689 and 56,,). Corresponding current 
fluctuations S/o and d/,, produced by such excitations are also depicted (thin 
red lines). (@) Calculated nonlinear current response for the unit cell of 

(E) after application of a single-cycle optical pulse centered at 500 GHz 
frequency [see section S5 of (9) for details]. 


We next turn to the key results of this report, 
which provide evidence for superfluidity in the 
normal-state striped phase in Lay. ggsBag 44;CuO, 
(a = 11.5%). In this compound, striped charge order 
coexists with superconductivity below T, = 13 K and 
extends into the normal state up to T., = 55 K (J). 

Figure 4A displays the results for T < T, = 
13 K. Note that in this material, the equilibrium 
Josephson plasma resonance was at lower fre- 
quency than in the other two compounds with 
higher 7, (at ~150 GHz) and could not be ob- 
served [see section S2 of (9)]. However, a third 
harmonic at ® = 3@pump * 1.4 THz was clearly 
observed. Notably, the third-harmonic signal re- 
mained finite also for T > T, = 13 K and up to 
T ~ Teo = 55 K (Fig. 4, C and D). A giant third 
harmonic, amounting to several percent of the 
driving field, can be understood only in the pres- 
ence of superconducting tunneling above 7, and 
up to Tyo. 

In the following paragraphs, we show that a 
pair density wave, which does not show features 
of superfluidity in the linear optical properties, 


homogeneous condensate. As shown in Fig. 4E, 
the order parameter phase of the PDW, encoded 
by the vector angle (black arrow), changes be- 
tween neighboring stripes and rotates by 90° 
from one plane to the next (8), resulting in a 
checkerboard lattice of 1/2 and -n/2 Josephson 
junctions. Hence, the equilibrium PDW supports 
a lattice of staggered tunneling supercurrents, 
which average out to zero (thick red arrows in 
Fig. 4E). 

The interlayer phase fluctuations probed in the 
optical response are described by two normal modes, 
termed here @y and ¢~,. The @p mode is optically 
active. For an optical field E(t) = Eosin(@pump?), 
one has identical phase excursions 60(t) = 
2edEo 
hOpump 
black arrows in Fig. 4F). However, current fluc- 
tuations 6/9(¢) of equal magnitude, but opposite 
sign, at neighboring n/2 and -/2 junctions, makes 
this mode silent (red arrows in Fig. 4F). The sec- 
ond ¢, mode consists of phase excursions 60,(¢) 
occurring in opposite directions at neighboring 


COS(@pumpt) at each site (shading under 


retains the large nonlinear optical signal of the 
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sites and is optically inactive (see Fig. 4F). 


2 February 2018 


In the nonlinear regime, the optical response 
of the PDW is no longer zero. Because @o is odd 
and @, is even, a nonlinear expansion of the 
Josephson energy (U) can be written as U(@o,,,) & 
JoOn - J" On + Co’ Pps Where Jo and J” are the in- 
plane and out-of-plane Josephson energies and 
C=J"/2, with Jo > J", where C is the coupling 
constant between the two normal modes [see 
section S5 of (9)]. 

A large third-harmonic signal is readily pre- 
dicted from the 9", coupling term. For an op- 
tical field of the type E(@) = Epsin(@pumpt), which 
acts only on the mode @, the equations of mo- 
tion of the phase are Qo * @pumpEoCOS(@pumpt ) 
and ¢~, * -2C@o". These coupled equations imply 
not only phase oscillations in @ 9 mode at 
pump» L-€., 609(@pump), but also indirect excita- 
tion of the optically inactive ~,, mode. Because 
in the equation of motion for ¢~, the driving 
force is proportional to @o”, the phase of this 
mode 66, is driven at 2,ump- Because the total 
nonlinear current [jo¢ = LJjunctions contains 
terms of the type Ij, ~ 2J-@oz, the two phase 
coordinates @o(Mpump) and @;,(2@pump) are mixed 
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and produce current oscillations at the difference 
and sum frequencies ®pump aNd 3@pump. A nu- 
merical solution of the two equations of motion 
with realistic parameters for the Josephson 
coupling energies Jy and J” displays oscillatory 
currents at the fundamental and third harmonic 
[Fig. 4G, also see section S5 of (9) for details on 
the simulations]. 

Finally, from the model above, the third- 
harmonic current is predicted to scale linearly 
with the out-of-plane critical current and, hence, 
the superfluid density of the stripes. From the 
plot in Fig. 4D, the local superfluid density at 
spin- and charge-ordering temperature T,, is 
found to be 60% of that measured below T, and 
subsequently decreases continuously as the tem- 
perature is increased further, before vanishing 
atl. 

Although the model discussed above provides 
a plausible description of the experimental ob- 
servations, other hypotheses for the origin of the 
third-harmonic signal for T > T. should be con- 
sidered. The measured and simulated third 
harmonic is far larger than, and hence easily 
distinguished from, the effect of noncondensed 
quasiparticles. The nonlinear susceptibility de- 
tected in the present experiments x ~ 10-°m?/V? 
(obtained from the electric field strengths at 
the fundamental and third harmonic) is sev- 
eral orders of magnitude bigger than typical 
cubic nonlinearities (y ~ 1078 - 10°°m?/V?) 
(19-21). Furthermore, first-principle calcula- 
tions show that, for this compound, the value 
of Y¥ from quasiparticle transport within an- 
harmonic bands is at least three orders of mag- 
nitude smaller than what is measured here [see 
section S6 of (9)]. 

A second alternative may involve the sliding 
of a charge density wave along the c axis, as dis- 
cussed in (22) for blue bronze. However, the ef- 
ficiency of the sliding of a charge density wave, 
reported in (22) for kilohertz frequency excita- 
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tion, is expected to reduce strongly at higher 
excitation frequencies and can be ruled out for 
the terahertz irradiation [see section S7 of (9)]. 
The results for the 15.5% sample, where the third- 
harmonic signal disappears at T, = 32 K (<T., = 
40 K), further indicate that the third harmonic 
results from superconducting tunneling rather 
than charge ordering. 

The observation of a colossal third-harmonic sig- 
nal in the stripe-ordered state of LayggsSto145;CuO. 
provides compelling experimental evidence for 
finite momentum condensation in the normal 
state of cuprates and underscores the power of 
nonlinear terahertz optics as a sensitive probe 
of frustrated excitations in quantum solids. A 
natural direction for this line of research involves 
the study of other forms of charge order that 
compete or coexist with superconductivity, such 
as those found in YBazCu30¢..x (23, 24). One may 
also find application of these techniques in other 
regimes of the cuprate pseudogap, with finite 
superfluid density, with vanishing range phase 
correlations (25, 26), or where other forms of 
density waves (27, 28) have been discussed. 
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Observation of chiral phonons 


Hanyu Zhu,’” Jun Yi,’ Ming-Yang Li,? Jun Xiao,’ Lifa Zhang,* Chih-Wen Yang,” 
Robert A. Kaindl,” Lain-Jong Li,? Yuan Wang,"?* Xiang Zhang»?* 


Chirality reveals symmetry breaking of the fundamental interaction of elementary 
particles. In condensed matter, for example, the chirality of electrons governs many 
unconventional transport phenomena such as the quantum Hall effect. Here we show that 
phonons can exhibit intrinsic chirality in monolayer tungsten diselenide. The broken 
inversion symmetry of the lattice lifts the degeneracy of clockwise and counterclockwise 
phonon modes at the corners of the Brillouin zone. We identified the phonons by the 
intervalley transfer of holes through hole-phonon interactions during the indirect infrared 
absorption, and we confirmed their chirality by the infrared circular dichroism arising from 
pseudoangular momentum conservation. The chiral phonons are important for electron- 
phonon coupling in solids, phonon-driven topological states, and energy-efficient 


information processing. 


hirality is a fundamental property of an 

object not identical to its mirror image. For 

elementary and quasiparticles, it is an im- 

portant quantum concept at the heart of 

modern physics. For example, the discovered 
handedness of neutrinos in electroweak interac- 
tion revolutionized our understanding of the uni- 
versal parity conservation law. Chiral fermions 
with spin-momentum locking can emerge in solid- 
state systems such as the recently observed Weyl 
semimetals with inversion symmetry-breaking 
lattices (1). Electron chirality in graphene defined 
through pseudospin was found to cause distinc- 
tive transport properties such as unconventional 
Landau quantization and Klein tunneling (2, 3). 
Meanwhile, electrons in monolayer transition 
metal dichalcogenides with broken inversion sym- 
metry display optical helicity, which opens the 
field of valleytronics (4). The outstanding ques- 
tion is whether bosonic collective excitations such 
as phonons can attain chirality. Recently, the 
intrinsic chirality of phonons without applied 
external magnetic fields in an atomic lattice 
was predicted theoretically at the corners of the 
Brillouin zone of an asymmetric two-dimensional 
(2D) hexagonal lattice (5). Because of the three- 
fold rotational symmetry of the crystal and the 
momentum vectors, the vibrational plane wave is 
composed of unidirectional atomic circular ro- 
tation. Because the phonons in the same valley 
have distinct energy levels, the rotation is not 
the superposition of linear modes, in contrast to the 
circular motion in nonchiral cases (6). The hy- 
pothetical chiral phonons are potentially im- 
portant for the control of intervalley scattering 
(7, 8), lattice modulation-driven electronic phase 
transitions (9, 10) and topological states (1D), as 
well as information carriers that can be robust 
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against decoherence for solid-state quantum in- 
formation applications (12). 

Here we report the observation of intrinsic 
chiral phonons via transient infrared (IR) spec- 
troscopy in the atomic lattice of monolayer WSe.. 
A particle’s angular momentum is calculated 
by the phase change under a rotational trans- 
formation. This definition can be extended for 
quasiparticles in solid lattices with discrete spa- 
tial symmetry, as long as the rotation is in the 
crystalline symmetry group (13). For an inversion 
symmetry-breaking 2D hexagonal lattice such 
as monolayer WSe, (Fig. 1A), phonons with well- 
defined pseudoangular momentum (PAM) are 
located at the I, K, and K’ points in the recip- 
rocal space (5). Because the phonons inherit the 
threefold rotational (C3) symmetry of the lattice, 
their function of motion must be C3 invariant 
except for a phase difference (i.e., circular) (74). 
The K and K’ phonons are particularly notable 
because there is no reflection line that preserves 
both the lattice and their momentum (fig. S5). 
Therefore, the atomic rotation cannot be reversed 
without changing its momentum or energy. For 
example, if we attempt to reverse the rotational 
direction of the Se atoms in a longitudinal op- 
tical phonon LO(K) while keeping the relative 
phase determined by the q = K momentum, the 
mode changes to a longitudinal acoustic pho- 
non LA(K) that oscillates at a different frequen- 
cy. Thus, unlike the P phonon, such a rotation 
cannot be decomposed into orthogonal linear 
vibrations with a +/2 phase shift. On the other 
hand, to maintain the same mode, the relative 
phase must be reversed, resulting in a phonon at 
the K’ point. 

We deduce the phonon PAM / for each mode 
from the phase change after counterclockwise 
120° rotation: Ca(u,) = e%?"/?"2y,, where v4, is the 
function of atomic motion (fig. S6). Because LACK) 
is always C3 symmetric, it has 7 = 0; in con- 
trast, LO(K) gains negative phase after rota- 
tion (becomes ahead of time) and thus 7 = 1. 
The K phonons with different PAM are com- 
pletely nondegenerate because of the mass differ- 
ence between W and Se. The lifting of degeneracy 
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guarantees that each mode has a distinctive 
selection rule for the electron-phonon and op- 
tical scattering (table S1). These features differ- 
entiate the intrinsically chiral phonons from any 
previously investigated phonons where circular 
atomic motion results from superposing linear 
polarized eigenmodes (6, 15, 16). Chirality in the 
absence of an external magnetic flux is also dis- 
tinguished from the magnetic field-induced split 
of degenerate superposition modes (17). Finally, 
both the physical origin and properties of the 
atomic chiral phonons are fundamentally dif- 
ferent from previous phononic crystal-based chi- 
rality from geometric engineering or topological 
edge states (18-20). 

We use an optical pump-probe technique to 
identify the chiral phonon by its scattering with 
holes (Fig. 1B and fig. S2). The linear momentum, 
PAM, and energy of the respective phonon modes 
are determined by characterizing all of the other 
particles involved in the intervalence band tran- 
sition (IVBT) process (J4). First, we inject holes at 
the K valley by a left-circularly polarized (LCP) 
optical pump pulse. The K valley-polarized hole 
relaxes to the valence band edge with initial lin- 
ear momentum p; = —K. It can transit to the K’ 
point (p; = K) by emitting a K phonon (q = p; - p= 
-2K = K), but the intermediate state is virtual 
because the spin-conserving state has much high- 
er energy due to large spin-orbit coupling. We 
then send an IR pulse to satisfy the energy con- 
servation and place the hole in the spin-split band 
at K’. Because the hole states have zero PAM, the 
PAM of the phonon must be equal to the spin 
angular momentum of the absorbed IR photon. 
Therefore, the LCP pulse controls the creation of 
only LO(K) phonons. Eventually, we recognize the 
final spin-split state in the opposite valley by the 
energy and right circular polarization of its radi- 
ative decay. The selection rule of phonon crea- 
tion will be identical if the hole first absorbs an IR 
photon, transits to an intermediate state within 
the valley, and then emits a phonon, as long as the 
final state is the same. The phonon annihilation 
is negligible because the population is very low 
for all phonon modes at the K or K’ points at a 
base temperature of 82 K. The selection is also 
not affected by a small momentum distribu- 
tion of the initial and final holes (74) due to 
finite thermal energy or the strong many-body 
interaction (21). This is because, like the optical 
valley-selectivity, the contrast of electron-phonon 
coupling strength remains large in the vicinity 
of K and K’, even if the PAM is only defined on 
these points. 

To determine the linear momentum of the 
involved phonon, we measure the linear momen- 
tum of the initial and final holes. Under circu- 
larly polarized pump excitation, the majority of 
holes maintain their linear momentum near the 
original valley, as shown by the positive helicity 
of the photoluminescence. The final linear mo- 
mentum of the spin-split states is read out by the 
polarization of their luminescence at higher en- 
ergy than the original pump photon (Fig. 2A). 
We measure the difference of luminescence with 
the same polarization as the pump light (game) 
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Fig. 1. Nondegenerate chiral phonons in monolayer WSe> and the selec- 
tion rule of hole-phonon interactions. (A) The atomic motion of W and Se 
atoms (blue and yellow spheres, respectively) in the real space is intrinsically 
circular for the chiral phonons residing at the K point (red dot) in the reciprocal 
lattice (array of black dots), due to the threefold rotational symmetry. Because 
the momentum vector determines the relative phase of the Se motion, 
opposite rotations correspond to completely different modes. The two modes 
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have distinct PAM with respect to the center of the hexagon. (B) In the 
intervalley optical transition of holes, the PAM of the K phonon is equal to the 
spin of the IR photon due to angular momentum conservation. A hole injected 
by the LCP optical pump moves from the K to the K’ valley through virtual 
scattering with a LO(K) phonon. Owing to the spin and energy mismatch, a real 
phonon is created only if the hole simultaneously absorbs an IR probe photon 
and transits to the spin-split state, which is signaled by the RCP luminescence. 
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Fig. 2. Chiral phonon creation from the intervalley transition of holes 
measured by polarized luminescence. (A) The spin-allowed intervalley 
transition of holes from the K point creates a K phonon, yet the spin-forbidden 
transition within the same valley does not. The two processes are distinguished 
by the polarized luminescence signal. (B) The observed initial positive 
polarization indicates a spin-flipping transition of nonequilibrium holes, with a 
lifetime of ~0.2 ps. After that, the negative polarization shows a prevailing 


phonon-creating intervalley transition for valley-polarized holes, with a lifetime 

of ~2 ps at 82 K. The pump and probe energies are tuned to 1.97 and 0.51 eV, 
respectively, whereas the signal is integrated from 2.05 to 2.19 eV. (C) The 
spectrum of the polarized luminescence taken at t = 0.8 ps agrees with the 
B-exciton photoluminescence (PL) and confirms its spin-split hole origin. 
The error bars in (B) and (C) indicate SE calculated from multiple measure- 
ments. a.u., arbitrary units. 
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Fig. 3. Chirality of phonons measured by transient IR CD. (A) The 
phonons participating in the indirect optical transition are different for the 
LCP and RCP IR photons because of the conservation of linear and angular 
momentum. Their distinct electron-phonon scattering strengths dictate 
that the two processes have different amplitudes, leading to polarized 

R absorption. ha(K), hole at the valence band edge of the K valley; 
hg(K’), hole in the spin-split band of the K’ valley. (B) The measured CD at 
82 K (black squares and red fitting curve) is positive for both the initial 
intravalley spin-flipping transition and the later intervalley phonon-creating 
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transition. It not only proves the chirality of phonons but also indicates 
that the LO phonon process is dominant. (C) The spectrum of the 
transient CD acquired at t = 0.8 ps shows a photonic energy threshold 
of 0.448 + 0.002 eV. Its step shape is expected from the 2D excitonic 
density of states. Subtracting the energy difference between the initial and 
final hole states, a phonon energy of 29 + 8 meV is deduced, in agreement 
with that of the LO mode. The error bars denote SE calculated from 
multiple measurements, and the threshold uncertainty indicates the SE 
of the fitting parameter. 
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and the opposite polarization Woppo) as a func- 
tion of the probe delay time t (Fig. 2B). We find 
that the polarized luminescence (Igame — Joppo) is 
positive for t < 0.2 ps, indicating that the spin- 
flipping intravalley IVBT has a higher probability 
when the IR probe arrives immediately after the 
excitation of holes. This is because spin is not 
strictly conserved for optical transitions away 
from the high-symmetry points. However, the 
process is forbidden at the valley center with 
incident light perpendicular to the lattice plane, 
so its magnitude decays with a lifetime of 0.2 ps 
as the nonequilibrium carriers thermalize to the 
band edge (22, 23). After that, the luminescence 
switches polarization as a result of the dominant 
spin-conserving intervalley IVBT. It shows that 
the majority of final holes have a linear momen- 
tum opposite that of the initial holes. The mo- 
mentum must have been transferred to phonons 
instead of defects, as seen from the comparison 
of the second-order Raman scattering through 
the defect-assisted one-phonon process with that 
of the two-phonon process, which is an order of 
magnitude stronger (fig. S7). Directional inter- 
valley transfer through electron-electron interac- 
tions (24) is excluded because the pump is not in 
resonance with the A exciton, and the signal is 
linearly proportional to the hole density (fig. S8). 
The exchange interaction reduces the polariza- 
tion of the holes but should not create an oppo- 
site polarization (25). Finally, the depolarization 
plus the decay of the hole population yields a 
joint lifetime of ~2 ps for the polarized lumi- 
nescence, in agreement with previous exciton 
studies (26). To further verify the nature of the 
final holes, we measure the polarized lumines- 
cence at different collection energies with t = 
0.8 ps. The emission spectra obtained with dif- 
ferent IR photon energies are all close to the B- 
exciton photoluminescence (Fig. 2C and fig. $12), 
which proves that the emission is the radiative 
decay of a real state near the edge of the spin- 
split band, rather than a virtual state. Therefore, 
we use the luminescence from 2.05 to 2.19 eV to 
quantify the indirect IR absorption and the pho- 
non creation. 

The PAM of valley phonons is determined 
by measuring the polarization selection of the 
absorbed IR photon, resulting from the chiral 
electron-phonon interaction. With LCP excita- 
tion, the intervalley transition of holes from the 
K valley can either absorb an LCP IR photon to 
produce LO phonons or absorb a right-circularly 
polarized (RCP) IR photon to produce TA/A, 
phonons (Fig. 3A). We observe different IR ab- 
sorption when the probe has the same polar- 
ization as the pump light (dsame) versus when it 
has the opposite polarization (Goppo). This IR 
circular dichroism (CD = Osame — Goppo) dem- 
onstrates that scattering cross sections of the 
two processes are not equal, owing to the dif- 
ference of electron-phonon coupling strength. 
As shown in Fig. 3B, the CD at 82 K is always 
positive for valley-polarized holes. Because the 
holes have zero PAM regardless of valley index 
or spin-split bands, the angular momentum of 
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the IR photon is transferred to either the spin 
or the phonon. The initial intravalley IVBT pro- 
duces positive CD because the electronic spin 
flips from -1/2 in the spin-split band to +1/2 
at the valence band edge at the K point. How- 
ever, this contribution decays fast as a function 
of IR delay in accordance with Fig. 2B. Thus, 
the intervalley phonon-creating transition is 
the dominant source of positive CD for t = 0.8 ps. 
It indicates that the LO branch contributes most 
to the indirect IR absorption. For nonchiral sys- 
tems, although photons with opposite circular 
polarization may interact with different par- 
ticles, the probabilities are always equal. There- 
fore, the CD is the signature of intrinsic phonon 
chirality. 

The energy of this phonon mode is measured 
according to energy conservation: The energy 
sum of the incoming particles, the IR photon, 
and the initial holes must be equal to that of the 
outgoing particles, the chiral phonon, and the 
final holes (74). We observe that the CD spectrum 
at t = 0.8 ps (Fig. 3C) is a step function with a 
clear threshold near 0.448 + 0.002 eV, broad- 
ened by the spectral width of the IR pulse. The 
shape corresponds to a transition from a hole at 
the valence edge to a band with parabolic dis- 
persion in two dimensions (fig. S9). This value is 
distinguished from either the intra-excitonic tran- 
sition (27) or the exciton dissociation energy (27). 
Next, we find that the ground-state configuration 
of the initial holes is the dark A trion at E; = 1.671 + 
0.006 eV (fig. S11) (28) based on the dominant 
bright A-trion photoluminescence (fig. $1). We 
verify that the dark trions are near thermal 
equilibrium because the IR spectrum is very dif- 
ferent from that of nonequilibrium carriers with 
excessive energy at t = 0 ps (fig. S10). The final 
state is the bright B trion at E; = 2.090 + 0.005 eV, 
as measured through its emission (fig. $12). Sum- 
marizing these values, we deduce the phonon 
energy Epnonon = Ephoton + — Ep = 29 + 8 meV, in 
agreement with the first-principles calculation 
and the Raman spectrum for the LO(K) mode 
(figs. S7 and S9). The uncertainty may be reduced 
in the future by improving the uniformity of 
the sample and the spectral resolution of the 
B-exciton emission. Better spectroscopy will also 
potentially reveal details about the chiral phonon- 
exciton coupling—such as the contribution from 
LA phonons and the various pathways of the 
indirect transition—that are not resolvable with 
the current signal-to-noise ratio. 

Our findings of chiral phonons are fundamen- 
tally important for potential experimental tests 
of quantum theories with chiral bosons (29) in 
the solid state. Our work also provides a possi- 
ble route for controlling valley and spin through 
electron-phonon scattering and strong spin- 
phonon interactions (16). Furthermore, the lift- 
ing of degeneracy by chirality offers robust PAM 
information against decoherence and long-range 
perturbation and offers a new degree of freedom 
to the design and implementation of phononic 
circuitry (30) at the atomic scale without mag- 
netic fields. 
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CD8* T cell-dependent killing of cancer cells requires efficient presentation of tumor antigens by human 
leukocyte antigen class | (HLA-I) molecules. However, the extent to which patient-specific HLA-I genotype 
influences response to anti-PD-1 or anti-CTLA-4 is currently unknown. We determined the HLA-I genotype 
of 1,535 advanced cancer patients treated with immune checkpoint blockade (ICB). Maximal 
heterozygosity at HLA-I loci (A, 8B, and C) improved overall survival after |CB compared to patients who 
were homozygous for at least one HLA locus. In two independent melanoma cohorts, patients with the 
HLA-B44 supertype had extended survival, whereas the HLA-B62 supertype (including HLA-B*15:01) or 
somatic LOH at HLA-I, was associated with poor outcome. Molecular dynamics simulations of HLA-B*15:01 
revealed unique elements that may impair CD8* T cell recognition of neoantigens. Our results have 
important implications for predicting response to ICB and for the design of neoantigen-based therapeutic 


vaccines. 


Immune checkpoint inhibitors that target cytotoxic T-lym- 
phocyte-associated protein 4 (CTLA-4) or programmed cell 
death protein 1 (PD-1) or its ligand (PDL-1) have markedly 
improved the treatment of patients with metastatic cancer 
(1-3). However, tumor responses to these drugs are variable, 
and treatment resistance is common (4-6). To date, most re- 
search to predict clinical efficacy to immune checkpoint 
blockade (ICB) therapies has focused on tumor immune phe- 
notype, somatic genomic features, or the gut microbiome (7- 
21), but how host germline genetics impacts response is un- 
clear. 

HLA class I (HLA-I) genotype has been linked with differ- 
ential immune responses to infection, inflammatory condi- 
tions, and autoimmune diseases (22-30). Each HLA-I 
molecule binds specific peptides derived from intracellular 
proteins for presentation on the cell surface to CD8* T cells. 
(31-33). The anti-tumor activity of ICB has been shown to de- 
pend on CD8* T cell, HLA class I-dependent immune activity 
(34-36). We performed survival and genetic association anal- 
yses to address two hypotheses: (i) zygosity at HLA-I genes 
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influences survival of cancer patients to ICB, and (ii) individ- 
ual HLA-I germline alleles influences survival to ICB differ- 
ently. 

We examined two sets of cancer patients (henceforth 
called cohort 1 and cohort 2) treated with ICB. Cohort 1 (n = 
369 patients) was treated with anti-CTLA-4 or anti-PD-1 ther- 
apy, and exome sequencing and clinical data were obtained. 
Within cohort 1, 269 patients had advanced melanoma [pre- 
viously reported (7, 11, 12, 17)], and 100 patients had advanced 
non-small cell lung cancer (NSCLC) (10) (table S1). Patients 
with NSCLC were treated mainly with anti-PD-1 monother- 
apy. Cohort 2 (7 = 1,166 patients) comprised different cancer 
types including melanoma and NSCLC (table S1), and tumors 
were subjected to targeted next-generation sequencing (MSK- 
IMPACT) (37). These patients were treated with drugs target- 
ing CTLA-4, PD-1/PD-LI, or a combination of both, at the Me- 
morial Sloan Kettering Cancer Center (37). For all patients in 
both cohorts, we performed high-resolution HLA class I gen- 
otyping from normal DNA using DNA sequencing data or a 
clinically validated HLA typing assay (LabCorp). 
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HLA class I molecules are highly polymorphic with varia- 
tion located in the peptide-binding region, as each variant 
binds a select repertoire of peptide ligands. As such, an indi- 
vidual homozygous in at least one HLA-I locus would be pre- 
dicted to present a smaller, less diverse repertoire of tumor- 
derived neoantigens to cytotoxic T lymphocytes (CTLs) com- 
pared to a person who is heterozygous at each class I locus 
(32). We therefore asked whether greater diversity (heterozy- 
gosity) in the repertoire of antigen-presenting HLA-I mole- 
cules could be associated with better survival following ICB 
therapy. We examined HLA-I variation at each of the genes 
(HLA-A, -B, and -C) in cohort 1 and cohort 2, by employing a 
Cox proportional hazard regression model to examine overall 
survival probability. HLA-I homozygosity in at least one locus 
was associated with reduced survival in cohort 1 (n = 369; P 
= 0.036, HR = 1.40, 95% CI 1.02 - 1.9) (Fig. 1A), and was vali- 
dated in the independent cohort of 1,166 patients (cohort 2; 
P = 0.028, HR = 1.31, 95% CI 1.03 - 1.70) (Fig. 1B). The number 
of somatic mutations in tumors was not statistically different 
between homozygous and heterozygous patients (fig. S1, A 
and B). Furthermore, the association of HLA-I homozygosity 
with reduced survival remained significant in multivariable 
Cox regression modeling when analyzed for mutation load, 
tumor stage, age, and drug class in cohort 1 (P = 0.02, HR = 
1.50, 95% CI 1.07 - 2.10) (table S2) and in cohort 2 (P = 0.028, 
HR = 1.31, 95% CI 1.03 - 1.67) (table S3). 

We next examined all 1,535 patients from cohort 1 and 2 
together, to determine whether the effect of homozygosity 
may be due to a single HLA-I locus, or a combination of dif- 
ferent loci. This analysis revealed that homozygosity at one 
HLA-I locus (A, or B, or C) was associated with significant 
reduction of overall survival (P = 0.003, HR = 1.38, 95% CI 
1.11 - 1.70) (Fig. 1C). Interestingly, the effect of homozygosity 
on survival due to specific HLA-I locus seemed mostly asso- 
ciated with HLA-B (P = 0.052, HR = 1.66, 95% CI 0.93 - 2.94) 
(Fig. 1C) and HLA-C (P = 0.004, HR = 1.60, 95% CI 1.16 - 2.21) 
(Fig. 1C). Of note, the number of patients available likely lim- 
ited the interpretability of analyses involving combinations 
of loci (e.g., HLA-A and -B). Our findings may be explained 
because HLA-B is generally expressed at higher levels on the 
cell surface than HLA-A and HLA-C, and because HLA-B al- 
leles bind to a greater diversity of peptides (38, 39). Amino 
acids that bind to the B pocket of HLA-A alleles are broadly 
hydrophobic. In contrast, the B pocket of HLA-B alleles can 
accommodate a greater variety of residues (29, 39). Interest- 
ingly, antigen-presenting cells express higher levels of HLA- 
C on the cell surface than other cell types (40), suggesting 
that heterozygous HLA-C may facilitate continuous CTL 
priming (41). 

Previous reports have shown that the total number of so- 
matic coding mutations in a cancer genome correlates with 
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response to ICB (7, 8, 10-12, 17). An explanation for this ob- 
servation is that the number of tumor mutations presented 
on the cell surface increases the probability of neoantigen 
recognition by cytotoxic T cells (42). We found that HLA-I 
homozygosity and low mutation burden were strongly asso- 
ciated with decreased survival compared to patients who 
were heterozygous at each class I locus and whose tumors 
had high mutation burden, in cohort 1 (P = 0.003, HR = 2.03, 
95% CI 1.27 - 3.30) (Fig. 1D) and in cohort 2 (P < 0.0001, HR 
= 2.98, 95% CI 1.84 - 4.82) (Fig. 1E). Notably, the combined 
effect of HLA class I heterozygosity and mutation load on im- 
proved survival was greater compared with mutation load 
alone (Fig. 1, F and G). 

Previous work has reported loss of heterozygosity (LOH) 
of HLA class I genes in cancer (43, 44). We thus analyzed all 
tumor exomes from cohort 1 and identified 32 patients who 
were heterozygous at all HLA-I loci, but had LOH in at least 
one HLA-I locus in their tumors (table S1). Patients with LOH 
of HLA-I were associated with reduced survival (P = 0.05, HR 
= 1.60, 95% CI 1.03 - 2.43) (Fig. 1H). Furthermore, the effect 
of LOH of HLA-I on survival was greater in patients whose 
tumors contained low mutation load (P = 0.0006, HR = 3.68, 
95% CI 1.64 - 8.23) (Fig. 11). Given that only a small fraction 
of presented tumor mutations are immunogenic in cancer pa- 
tients (45, 46), our findings suggest that relatively small dif- 
ferences in the number of available HLA-I molecules in a 
given individual can present major challenges to effective 
anti-tumor T cell responses and efficacy to ICB. Furthermore, 
the demonstration of a significant survival advantage to 
HLA-I heterozygosity in patients treated with ICB both at the 
germline and somatic level, highlights its importance in the 
dynamic anti-tumor immune response and immune evasion. 

As an exploratory analysis, we also found that HLA class 
II homozygosity at HLA-DP was associated with reduced sur- 
vival (P = 0.018, HR = 1.45, 95% CI 1.06 - 2.00) (fig. S2A). 
Additionally, homozygosity at HLA-DPB locus was associated 
with decreased survival (P = 0.04, HR = 1.37, 95% CI 1.07 - 
1.87). This effect was independent of the associations of HLA- 
I homozygosity and mutation burden (tables S4 and S5). It is 
noteworthy that mismatched HLA-DP has been shown to be 
associated with graft-versus-host disease (47). 

Additionally, we employed next-generation deep sequenc- 
ing of T-cell receptor CDR3 regions (TCR-seq) (48, 49) from a 
subset of tumor samples collected on-therapy (4 weeks post- 
Nivolumab initiation) (17). We found significantly higher on- 
therapy clonality of TCR CDR3s in HLA heterozygous pa- 
tients compared to HLA homozygotes (in at least one class I 
locus or at HLA-DP) (Wilcoxon rank-sum test P = 0.0093) 
(Fig. 3F and table S1). To refine the interpretation of this re- 
sult with respect to the antigen-binding properties of the TCR 
repertoire, the clonality of CDR3s encoded by a single VJ cas- 
sette combination was analyzed individually (17, 50). Notably, 
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higher on-therapy clonality of TCR CDR38s per VJ was ob- 
served in HLA heterozygotes (Wilcoxon rank-sum test P = 
0.023) (Fig. 3G). Altogether, these results indicate that the di- 
versity of HLA molecules in a given patient influences the se- 
lection and the resulting T-cell clonal expansion reactive 
against neoantigens following ICB (51). 

To investigate the clinical relevance of individual HLA-I 
alleles after ICB therapy, we examined the effects of HLA-I 
supertypes on overall survival. Individual HLA-I alleles are 
classified into twelve discrete supertypes (52, 53), based upon 
similar peptide-anchor-binding specificities (26, 52, 53). 
These supertypes together cover most HLA-A and HLA-B al- 
leles found in distinct populations (52, 53). 

To assess the effect of HLA supertype on survival, we fo- 
cused on melanoma patients, as there were a sufficient num- 
ber of patients in the two patient sets for meaningful analysis. 
Based on the biological definition of supertypes, we classified 
the 27 HLA-A alleles present in the patients with melanoma 
into six A supertypes, and the 50 HLA-B alleles into six B su- 
pertypes (Fig. 2A and table S1). We found two B supertypes, 
associated with survival outcome in patients with advanced 
melanoma treated with anti-CTLA-4. Patients with B44 su- 
perfamily alleles had significantly better survival (P = 0.01, 
HR = 0.61, 95% CI 0.42 - 0.89) (Table 1) and patients with 
B62 alleles had significantly reduced survival (P = 0.0007, HR 
= 2.29, 95% CI 1.40 - 3.74) (Table 1). The B44 supertype was 
present at a prevalence of 45% and the B62 supertype at 15% 
(Fig. 2A). We did not find any supertype significantly associ- 
ated with overall survival in patients with NSCLC, likely due 
to the limited sample size. 

We then examined whether these supertype associations 
were influenced by the presence of specific component HLA- 
T alleles. The B44 association was influenced by HLA-B*18:01, 
HLA-B*44:02, HLA-B*44:03, HLA-B*44:05, and HLA-B*50:01 
(P = 0.001, HR = 0.49, 95% CI 0.32 - 0.76) (Fig. 2C) (Table 1). 
And, the B62 association was significantly driven by HLA- 
B*15:01 (P = 0.002, HR = 2.21, 95% CI 1.33 -3.70) (Fig. 3A) 
(Table 1). Both of these B44 and B62 allele associations re- 
mained statistically significant (P = 0.01 and P = 0.02, respec- 
tively) after a Bonferroni correction. The variability in the 
effect on survival across these B44 alleles might be explained 
due to allele frequency in the cohort or due to particular dif- 
ferences in the peptide motifs inside or outside primary an- 
chor pockets (54, 55). 

In the independent cohort 2, melanoma patients treated 
with anti-PD-1 or anti-CTLA-4 who had these B44 supertype 
alleles had significantly better overall survival on univariate 
(P = 0.054, HR = 0.32, 95% CI 0.09 - 1.1) (Fig. 2, B and D), and 
multivariable analysis (tables S6 and S7). Furthermore, the 
effect of B44s on extended survival was greater when somatic 
mutational load was also considered in cohort 1 (P < 0.0001, 
HR = 0.23, 95% CI 0.13 - 0.41) (Fig. 2E) and in cohort 2 (P = 
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0.023, HR = 0.13, 95% CI 0.02 - 1.03) (Fig. 2F). The combined 
effect of the B44 alleles and mutation load was greater than 
simply considering mutation burden alone (Fig. 2, G and H). 
We note that, in general, outcomes of melanoma patients in 
cohort 2 tended to be better than in cohort 1 because patients 
who received ICB and were accrued to our protocol for MSK- 
IMPACT testing tended to have longer survival. Yet, despite 
this trend, we still observed a significant effect from the B44 
alleles. Notably, the B44 alleles did not associate with survival 
in patients with melanoma from The Cancer Genome Atlas 
(TCGA), suggesting that the presence of B44 was predictive 
of response to ICB and was not prognostic (Fig. 21). 

Most members of the B44 supertype share a preference 
for peptides with Glu (E) at anchor position P2, and polar and 
hydrophobic residues at the C terminus (54, 56) (Fig. 2J). In- 
terestingly, we found that one out of the six enriched amino 
acid mutations across these tumors was G > E (fig. S3). This 
observation suggests that there might be an enrichment of 
presentation of B44-restricted neoantigens. Additionally, a 
number of previously identified immunogenic antigens ex- 
pressed by melanomas are HLA-B44: restricted (Fig. 2J and 
table S8), including the testis antigen MAGEA3, restricted to 
HLA-B*44:03 and HLA-B*18:01 (both members of B44), and 
a clonal immunogenic neoantigen (FAM3C: TESPFEQHI) 
identified in a melanoma patient with long-term response to 
CTLA-4 blockade from cohort 1 (table $8) (71, 13). 

In contrast, the B62 association with poor survival driven 
by the HLA-B*15:01 allele was intriguing (Fig. 3A) (Table 1). 
In an exploratory analysis, we sought to determine whether 
any molecular features in HLA-B*15:01 are associated with its 
effect on survival. Out of all the HLA-B alleles that were avail- 
able for three-dimensional structural analysis (n = 119) (table 
S9), we identified three alleles at their highest resolutions, 
HLA-B*15:01, HLA-B*07:02, and HLA-B*53:01, as possessing 
a structural bridge in the peptide-binding groove (Arg62, 
Tle66, and Leu163) (Fig. 3, B and C). 

We postulated that this specific structural feature may 
modulate the effective T cell recognition of neoepitopes pre- 
sented on HLA-B*15:01. To evaluate the validity of this hy- 
pothesis, we conducted molecular dynamics (MD) 
simulations following similar protocols used in previous 
studies (57-59). 

In the case of HLA-B*07:02 and HLA-B*53:07, MD simu- 
lations demonstrated that the bound peptide expands the re- 
spective HLA binding cleft, effectively breaking the bridge 
(fig. S4, A to D). Conversely, in the HLA-B*15:01 molecule, the 
bridge was largely maintained with the peptide present, and 
the bridging residues were also made much less flexible (Fig. 
3, D and E). While the mean bridge separation remained 
nearly constant (~6A) in both systems of HLA-B*15:01 (Fig. 
3E), the fluctuations in this distance were less dramatic in the 
peptide-bound complex. Altogether, these unique structural 
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and dynamical elements of HLA-B*15:01 may impair the total 
strength of the interaction with T-cell receptor for effective 
neoantigen recognition. However, further experimental work 
will be necessary to test this hypothesis. It is noteworthy that 
we found several mutations in genes that have been recently 
reported to contribute to HLA and cytolytic activity (4-6) (fig. 
S5). However, we did not find any particular gene mutation 
associated with decreased overall survival. 

Our findings reveal that HLA-I genes influence patient 
survival to ICB. Both patient-specific HLA-I genotype as well 
as somatic alterations in tumors impacted clinical outcome 
to ICB, suggesting these factors could be considered in the 
design of future clinical trials. The observation that the B44 
is associated with extended overall survival may provide an 
opportunity for the development of therapeutic vaccines that 
potentially target immunodominant HLA-B44-restricted ne- 
oantigens expressed by melanomas. Our findings indicate 
that HLA-I homozygosity and LOH represent a genetic bar- 
rier to effective immunotherapy, and alternative ways to har- 
ness the immune system may be necessary to maximize 
clinical benefit. 
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Fig. 1. Effect of HLA class | homozygosity on survival in patients treated with immune checkpoint inhibitors. 
(A) Association between homozygosity in at least one HLA-! locus and reduced overall survival in cohort 1. (B) 
Association between homozygosity in at least one HLA-I locus and reduced survival in cohort 2. (C) Association 
between HLA-I homozygosity and decreased survival from all 1,535 patients. Data show one or more HLA-I loci or 
individual loci (HLA-A, HLA-B, and HLA-C). Indicated are the number of patients and hazard ratio (HR). Horizontal 
lines represent the 95% confidence interval. P value was calculated using the Log-rank test. (D) Patients in cohort 1 
with heterozygosity at all HLA-I loci and a high mutation load (defined as >113 mutations) compared to patients that 
are homozygous for at least one HLA-| locus and have a low mutation load. (E) Patients in cohort 2 with 
heterozygosity at all HLA-I loci and a high tumor mutational load (defined as >16.72 mutations) compared to patients 
that are homozygous in at least one HLA-I locus and have a low mutation load. (F) Distribution of hazard ratios to 
stratify cohort 1 patients based on tumor mutational load. The combined effect of HLA-| heterozygosity at all loci and 
mutation load on improved survival was greater compared to mutation load alone. (G) Distribution of hazard ratios 
to stratify cohort 2 patients based on mutation load. A range of cutoffs across the quartiles of mutation load were 
used. P values were calculated using the Wilcoxon-rank sum test. (H) Survival analysis showing that LOH of 
heterozygous germline HLA-I is associated with decreased overall survival in patients treated with ICB. (1) Survival 
analysis showing that the effect of LOH of heterozygous germline HLA-I is greater in tumors with low mutation 
burden compared to tumors with high mutation load and without LOH. High mutation load is defined as >113 
mutations. 
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Fig. 2. Influence of the HLA-B44 supertype on survival of melanoma patients treated with immune checkpoint 
inhibitors. (A) Prevalence of the different HLA supertypes in patients with melanoma from cohort 1. (B) Prevalence 
of the different HLA supertypes in the patients with melanoma from cohort 2. (C and D) Survival analysis of patients 
possessing the B44 alleles [B44 (+)] compared with patients without the B44 alleles [B44 (-)] from cohort 1(C) and 
cohort 2 (D). (E and F) Survival analysis of patients with the B44 alleles and with high mutation burden versus 
patients without B44 and with low mutation load, from cohort 1(E) and cohort 2 (F). (G) Distribution of hazard ratios 
to stratify cohort 1 patients based on mutation load. The combined effect of B44 and mutation load on increased 
survival was greater compared to simply considering mutation load alone. (H) Distribution of hazard ratios to stratify 
cohort 2 patients based on mutation load. A range of cutoffs across the quartiles of mutation load was used. P values 
were calculated using the Wilcoxon-rank sum test. (I) Survival analysis of melanoma patients with and without the 
B44 alleles from the TCGA cohort. (J) Left: Example of peptide motif common among B44 alleles, docked in complex 
with HLA-B*44:02 based on an available crystal structure (PDB: 1M60). The five common residues (E2, 13, P4, V6, 
and Y9) of the motif were reported in (56). Peptide residues are colored according to their properties as basic (red), 
acidic (blue), polar (green), or hydrophobic (grey). Center: Close up view of an example peptide conforming to the 
B44 motif (54, 56). Residues at positions 2 and 9 are important for anchoring the peptide in the HLA binding groove 
(54). Right: Alignment between B44 peptide motif and known immunogenic neoantigens (table S8) restricted to 
B44 expressed by melanomas. All neoepitopes feature Glu (E) at position 2; neoantigens are also either identical or 
similar to the motif at one or two additional positions. The neoantigen, FAM3C: TESPFEQHI, was identified in a 
melanoma patient with long-term response to anti-CTLA-4 from cohort 1. Sequence similarity was determined using 
standard residue classes (GAVLI, FYW, CM, ST, KRH, DENQ, and P). 
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Fig. 3. Effect of the HLA-B*15:01 allele on overall survival of melanoma patients treated with immune 
checkpoint inhibitors. (A) Survival analysis showing reduced survival in ICB-treated melanoma patients from 
cohort 1 with and without the HLA-B*15:01 allele. (B) Overview of the three-dimensional structure of the peptide- 
binding groove of HLA-B*15:01, light purple; bound peptide, yellow; bridging residues, light pink. (C) Side view of the 
bridge-sequestration effect over bound-peptide residue positions P2 (light blue) and P3 (red). (D) MD simulation 
snapshots of both the isolated HLA B*15:01 molecule and its complex with a 9-mer UBCH6 peptide; each trajectory 
was run over the course of 500 ns of simulation time. (E) Observables from the MD simulations described in D. The 
mean bridge distances in the HLA-B*15:01 molecule and in the HLA-B*15:01-peptide complex are comparable. The 
residue-position root mean square fluctuations (RMSFs) indicate that each of the bridging residues becomes more 
rigid in the presence of the peptide. (F) On-therapy clonality of TCR CDR3s between HLA heterozygous patients and 
patients who are HLA homozygous (in at least one class | locus or at HLA-DP). (G) On-therapy clonality of TCR CDR3s 
per VJ combination between HLA heterozygous patients and patients with HLA homozygosity (in at least one class 
| locus or at HLA-DP). 
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Table 1. Association of HLA supertype with overall survival of melanoma patients treated with immune checkpoint 


blockade. Data show the influence of specific HLA class I alleles on patient survival. 


HLA class I supertype Frequency 
A24 0.29 
AOI 0.59 
A03 0.52 
A02 0.5 
B58 0.09 
B62 0.15 
B27 0.29 
B44 0.45 
BO7 0.54 
BO8 0.2 
A01A03 0.04: 
AO1A24 0.09 
Alleles influencing the significant associations 

B44s, B*18:01, B*44:02, B*44:03, B*44:05, B*50:01 0.34 
B62s, B*15:01 0.13 


HR 
0.67 (0.44-1.03) 
0.87 (0.60-1.27) 
1.39 (0.96-2.03) 
1.13 (0.76-1.63) 
0.98 (0.51-1.88) 
2.29 (1.40-3.74) 
1.09 (0.73-1.63) 
0.61 (0.42-0.89) 
1.35 (0.92-1.97) 
0.85 (0.52-1.39) 
1.20 (0.49-2.94) 
0.89 (0.43-1.83) 


0.49 (0.32-0.76) 
2.21 (1.33-3.70) 


Pvalue 


0.07 
0.47 
0.08 
0.53 
0.96 
0.0007 
0.67 
0.009 
0.12 
0.51 
0.69 
0.76 


0.001 
0.002 
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The piRNA targeting rules and the 
resistance to piRNA silencing in 


endogenous genes 


Donglei Zhang,”?* Shikui Tu,?** Michael Stubna,! Wei-Sheng Wu,” Wei-Che Huang,” 


Zhiping Weng,” Heng-Chi Lee’ 


Piwi-interacting RNAs (piRNAs) silence transposons to safeguard genome integrity in 
animals. However, the functions of the many piRNAs that do not map to transposons remain 
unknown. Here, we show that piRNA targeting in Caenorhabditis elegans can tolerate 

a few mismatches but prefer perfect pairing at the seed region. The broad targeting capacity 
of piRNAs underlies the germline silencing of transgenes in C. elegans. Transgenes 
engineered to avoid piRNA recognition are stably expressed. Many endogenous 
germline-expressed genes also contain predicted piRNA targeting sites, and periodic 

An/Tn clusters (PATCs) are an intrinsic signal that provides resistance to piRNA silencing. 
Together, our study revealed the piRNA targeting rules and highlights a distinct strategy 
that C. elegans uses to distinguish endogenous from foreign nucleic acids. 


-element induced wimpy testis (Piwi) pro- 
teins and their associated Piwi-interacting 
RNAs (piRNAs) function as a guardian of 
animal genomes through transposon silenc- 
ing in various animals (1-5). However, many 
animals produce piRNAs that do not match trans- 
poson sequences. For example, the vast majority of 
the 15,000 piRNAs encoded by the Caenorhabditis 
elegans genome do not exhibit extensive comple- 
mentarity to transposons (3, 4, 6). In mice, tens of 


thousands of distinct piRNAs produced at the 
pachytene stage during spermatogenesis do not 
map to transposons (7). These observations sug- 
gest additional targets and functions of piRNAs. 

Identification of piRNA targets and the piRNA 
targeting rules has proven to be rather difficult. 
Cross-linking immunoprecipitation (CLIP) analy- 
ses of Piwi proteins suggest that they associate with 
diverse mRNAs (8-10). However, because diverse 
piRNAs engage with many mRNAs, it is difficult 


Fig. 1. 22G-RNA loci as a proxy to identify the targets 
of specific piRNAs. (A) An example of 22G-RNA 
distributions at one of the RNA targets of the 

synthetic piRNA (GFP-targeting piRNA #1) in the 
indicated strains with biological replicates. Each pink bar 
indicates the first nucleotide position and abundance 

of 22G-RNAs. The red bar indicates the position targeted 
by the synthetic piRNA. rpm, reads per million. 

B) A scatter plot showing the abundance of 22G-RNAs 
around each potential targeting site [100-nucleotide 

nt) window centered with each target site] of the 
synthetic piRNA (GFP-targeting #1) in the control strain 
and in the strain expressing the synthetic piRNA. The 
potential targeting sites are sites of RNA transcripts that 
pair to the specific piRNA with six or fewer mismatches. 
Marked in red are sites at which 22G-RNA levels increased 
more than fourfold in the strain expressing the synthetic 
piRNA relative to the control strain. (C) A scatter plot 
showing the abundance of 22G-RNAs at each potential 
targeting site of 21U-RNA-X1 in the N2 (wild-type) strain 
and in the strain containing a deletion of the 21U-RNA-X1 
coding loci. Marked in green are sites at which 22G-RNA 
levels decreased more than 75% in the strain loss of 
21U-RNA-X1 relative to the N2 wild type. (D) The pairing 
between piRNAs and identified targets. (Top) Examples 
of pairings between the piRNAs and their targets. 
(Bottom) A bar graph showing the percentage of 

base pairing at each position within the piRNAs with 

all 17 identified targets. GU wobble pairing is considered 
as paired here in order to highlight the near-perfect 
pairing at the seed region when all GU pairs are allowed. 
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to infer the target of a given piRNA from these 
CLIP analyses. Therefore, additional approaches 
are required to identify piRNA sites in vivo. In 
some cases, targets of piRNAs can be inferred if 
the mRNA target is cleaved by Piwi (77, 12). How- 
ever, these cleaved mRNAs likely present only a 
fraction of piRNA targets in vivo because the 
slicer activity of Piwi is dispensable for silencing 
in some animals, including C. elegans (13-16). Be- 
cause only few piRNA targets other than trans- 
posons have been identified, the piRNA targeting 
Tules remain undefined, and both sequence-specific 
and sequence-nonspecific functions of the Piwi/ 
piRNA complex have been proposed (8-10, 12, 17, 18). 

To gain insight into the piRNA targeting mech- 
anism, we identified the targets of a single piRNA 
and examined how the piRNA recognizes its targets. 
In C. elegans, piRNA targeting leads to the recruit- 
ment of RNA-dependent RNA polymerases (RdRPs) 
that produce secondary small RNAs named 22G- 
RNAs (fig. SIA) (3, 13, 15). These 22G-RNAs are 
loaded onto worm-specific Argonautes (WAGOs) 
to induce gene silencing (19-21). Because these 
22G-RNAs are produced around the targeting 
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site, the 22G-RNAs can serve as a “signature” for 
piRNA targeting sites in vivo (13, 15). Therefore, 
we identified the targets of a piRNA by examining 
the 22G-RNA species gained in animals expressing 
a synthetic piRNA or the 22G-RNA species lost in 
animals carrying a deletion of a specific piRNA 
(fig. S1B). We obtained animals expressing a syn- 
thetic piRNA or losing an endogenous piRNA 
through a CRISPR/Cas9-based genome-editing 
strategy that modified the locus of an endogenous 
piRNA (fig. SIB). Small-RNA sequencing confirmed 
the expression or loss of specific piRNAs in these 
animals (fig. S1, C to F) and was used to identify 
changes in 22G-RNA levels. Together, we identified 
six RNA targets in the animals producing the syn- 
thetic piRNAs and 11 RNA targets in animals lack- 
ing the endogenous piRNAs (Fig. 1, A to C, and 
table S1). We noticed that a region of the piRNAs, 
from the second to seventh nucleotide, pairs well 
to the identified targeting sites (Fig. 1D). This 
implies a critical role for the pairing of this re- 
gion in piRNA targeting, which we define as the 
piRNA seed. The piRNA seed is reminiscent of the 
microRNA (miRNA) seed, which is essential for 
miRNA target recognition (22). In addition, we 
observed apparent pairing outside of the piRNA 
seed region (Fig. 1D and fig. SIG). These observa- 
tions suggest that base pairing outside of the seed 
region also contributes to piRNA target recogni- 
tion, but a few mismatches can be tolerated. Fur- 
thermore, we noticed that GU wobble pairs are 
over represented, relative to other non-Watson- 
Crick pairs, in these targeting events (table S1). 
Last, the first nucleotide does not appear to con- 
tribute to piRNA targeting (Fig. 1D). 

In light of these findings, we developed a piRNA 
reporter assay so as to gain further insights into 
the piRNA targeting rules. In this assay, we exam- 
ined whether synthetic g/p-targeting piRNAs with 
various mismatches to the green fluorescent pro- 


Fig. 2. A piRNA reporter assay to investigate 
the piRNA targeting rules. (A) Fluorescence 
micrographs showing the expression of 
transgene GFP?'RNA::CDK-1 in worms 

carrying an extrachromosomal array 

to express the gfp-targeting piRNA with 
perfect pairing or in the control strain that 
does not express the synthetic piRNA. 

Arrows indicate the germline nuclei with 
expressed transgene. Circles indicate the 
germline nuclei with silenced transgene. The 
unmarked green fluorescent signals are 
autofluorescent signals generated from worm 
intestinal granules. (B) The sequences of 

the gfp-targeting piRNAs, the positions of the 
mismatches (red), and their effects on the 
expression of GFP®'RNA::CDK-1. Asterisk 
indicates gfp-targeting piRNAs produced by 
gene-edited animals modified at an endogenous 
piRNA locus (21U-5499). (C) Percentage of 
transgenic animals that exhibit the silencing 

of GFP IRNA:-CDK-1 in animals expressing 
specific gfp-targeting piRNAs. At least eight 
independent strains carrying extrachromosomal 
arrays (roller) are examined for each piRNA. 
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tein (GFP) sequence can trigger the silencing of an 
expressed GFP“?®S4::CDK-1 transgene (dpiRNA 
stands for depletion of piRNA targeting sites, in 
which the GFP sequence has been recoded to avoid 
silencing by endogenous piRNAs) (Fig. 2A, left, and 
fig. S2A). Because we noticed that the synthetic 
piRNAs can be produced from animals carrying 
extrachromosomal arrays with synthetic piRNA 
loci, we chose this method to systemically produce 
various gfp-targeting piRNAs (fig. S2A). We ob- 
served that GFP“?®N4::CDK-1 was silenced in the 
animals expressing synthetic piRNAs that are per- 
fectly complementary to GFP mRNAs or contain 
two mismatches outside of the piRNA seed region 
(Fig. 2, A to C, and fig. S2B). On the contrary, we 
failed to observe the silencing of GFP?’®™“::CDK-1 
when one or two mismatches were located at 
the piRNA seed region (Fig. 2, B and C, and fig. 
S2B). In addition, our reporter assay suggests that 
piRNAs tolerate up to three nonseed mismatches 
but not RNA bulges (fig. S2, C to E). We also ob- 
served that one GU wobble pair is tolerated in the 
seed region, and GU pairs are moderately more 
tolerated than mismatches in the nonseed region 
(fig. S2, C and D). Last, we obtained consistent 
results in our reporter analyses using gene-edited 
worms that express synthetic g/p-targeting piRNAs 
from an endogenous piRNA locus (fig. S2F). Over- 
all, our reporter assay revealed a similar but more 
stringent piRNA-targeting logic than that from 
our analyses of synthetic piRNA targets. Together, 
our analyses suggest that piRNA targeting in 
C. elegans prefers near-perfect pairing at the piRNA 
seed region. In addition, supplementary pairing 
outside of the seed region also contributes to 
piRNA targeting, but few mismatches are tolerated. 

It has been known for decades that transgenes 
carrying various foreign sequences, such as GFP 
or mCherry, are frequently silenced in the germ- 
line of C. elegans (23, 24). A previous study has 
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shown that Piwi protein PRG-1 is required for 
the silencing of the transgene GFP::CDK-1 (19). If 
piRNAs recognize GFP sequences, then removal 
of piRNA targeting sites from the GFP sequences 
should allow transgene GFP::CDK-1 expression 
in the germline. To predict more piRNA target- 
ing sites on transgenes, we used the relaxed 
piRNA targeting criteria similar to those derived 
from our analyses of synthetic piRNA targets 
(Data File S1, algorithms of target prediction). 
These criteria predicted 17 piRNA targeting sites 
on GFP mRNA (Fig. 3A and table S2). We intro- 
duced silent mutations in the GFP sequences so 
that we no longer identified piRNA targeting 
sites, yielding the recoded-GFP®™ sequences. 
Remarkably, although the GFP::CDK-1 transgene 
is always silenced in the germline of wild-type 
animals, we observed strong GFP expression from 
all five independent transgenic strains that we 
obtained with recoded GFP? ®™: :cDK-1 inserted 
at the same locus (Fig. 3B). 

To test whether our approach can be gen- 
erally applied to other transgenes in order to 
avoid gene silencing, we chose to modify the 
mCherry-tagged C-terminal region of Anillin 
(mCherry::ANI-1%!™), another transgene that is 
always silenced in the germline (25). We predicted 
10 piRNA targeting sites in mCherry mRNA and 
introduced silent mutations to disrupt predicted 
piRNA targeting sites (Fig. 3C and table S2). Where- 
as the original mCherry::ANI-1"™”? transgene was 
silenced in all six transgenic lines, the modified 
mCherry?2™, -ANI-1"° was robustly expressed 
at the cleavage furrow of the one-cell embryo in all 
six transgenic lines we obtained (Fig. 3D). 

As a last test, we applied this approach to mod- 
ify Cas9 sequences. Transgenic C. elegans strains 
stably expressing Cas9 have not been successfully 
obtained (26). Again, we introduced silent mutations 


in order to remove all predicted piRNA targeting 


Position of mismatches in GFP 
gfp -targeting piRNA #1 expression 
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Fig. 3. Silencing-prone transgenes can be 
expressed in the germline by avoiding piRNA 
targeting. (A) Predicted piRNA sites in GFP 

mRNA sequence. The numbers of piRNA 

sites that contain different types of mismatches 

are shown. The relaxed criteria are used to 

predict piRNA sites on transgenes: All GU wobble 
pairing is allowed (considered as paired), and up to 
three non-GU mismatches are allowed when sites 
have perfect seed pairing, or up to one non-GU 
mismatches are allowed when sites have one 
non-GU mismatches in the seed region. The 
mismatch at the first nucleotide of a piRNA is not 
counted or considered. (B) The expression of 
original GFP::CDK-1 that contains the predicted 
piRNA targeting sites, or the modified 
GFPP'RNA:-CDK-1 where all predicted piRNA 

sites have been removed by introducing 

silent mutations (right). Arrows indicate the 
germline nuclei with expressed GFP??RN4::CDK-1. 
Circles indicate the germline nuclei with 

silenced GFP::CDK-1. (C) Predicted piRNA sites 

in mCherry mRNA sequence (left). (D) The expression 
of original mCherry::ANI-16°!"5° that contains the 
predicted piRNA targeting sites, or the modified 
mCherry@?'RNA:: ANI-19°459 jn which the predicted 
piRNA sites have been removed by introducing 
silent mutations (right). Arrows indicate the 
expression of mCherry@?'R%4::AN|-12681159 at cleavage 
furrows of the one-cell embryo. (E) Predicted 
piRNA sites in Cas9 mRNA sequence. (F) A schematic 
showing the procedure followed to examine if 
genome editing occurs in transgenic animals that 
carry the original or modified Cas9 transgenes. 
Plasmids containing unc-22 sgRNA and rol-6(sul006) 
dominant transformation marker plasmid are 
coinjected into transgenic animals that have 

been carrying the Cas9 transgene for more than 
five generations. F1 transformed roller animals 

are picked, and their F2 progeny are scored 

for unc-22 gene editing through twitcher phenotype. 
(G) Sequences of various unc-22 edited alleles 
obtained in the animals carrying the modified Cas9 
transgene injected with plasmid encoding unc-22 
sgRNA. Indels are highlighted in red. 


sites (Fig. 3E and table S2) and obtained trans- 
genic animals carrying the original or the modified 
Cas9 transgene. To test whether Cas9 is stably ex- 
pressed, we injected the transgenic animals with an 
unc-22 single-guide RNA (sgRNA)-expressing plas- 
mid and a rol-6(suw1006) plasmid as a dominant 
transformation marker (Fig. 3F). The animals carry- 
ing unc-22 mutations exhibit a visible twitcher 
phenotype and can be easily identified (27). Out of 
30 F1 transformed progeny (roller), nine animals 
produced F2 twitcher progeny from animals car- 
rying the modified Cas9*”®™ transgene, whereas 
no F2 twitcher progeny were observed from ani- 
mals carrying the original Cas9 transgene. DNA se- 
quencing of these F2 twitcher animals confirmed 
that they carry various wnc-22-edited alleles (Fig. 
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3F). These observations functionally demonstrate 
that the modified Cas9*"*™™“ transgene is stably 
expressed and thus can create edited alleles. Taken 
together, these experiments verify that our pre- 
dictions of piRNA targeting sites encompass the 
critical sites that trigger gene silencing. 

We next wondered whether endogenous germ- 
line genes have evolved to avoid piRNA recognition. 
Previous studies have shown that most C. elegans 
germline transcripts are targeted by either WAGO 
Argonaute-associated 22G-RNA, which correlates 
with silencing of the transcript, or CSR-1 Argonaute- 
associated 22G-RNA, which correlates with expres- 
sion of the transcript (28, 29). Using stringent piRNA 
targeting criteria corresponding to the ones de- 
rived from our reporter analyses, we predicted that 
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around half of germline-expressed genes (CSR-1 
targets), as well as germline-silenced genes (WAGO-1 
targets), contain at least one piRNA targeting site 
(Fig. 4A and fig. S3A), which is sufficient for silenc- 
ing, at least in our g/p reporter assay. In addition, 
the density of piRNA targeting sites in germline- 
expressed genes is only slightly less than that of 
somatic-specific genes and control sequences (fig. 
S3B). Although the germline-silenced genes con- 
tain more predicted piRNA sites than germline- 
expressed genes, such differences alone cannot 
explain why only one set of genes is silenced (fig. 
S3B). Taken together, these predictions implied that 
some endogenously expressed germline genes are 
resistant to piRNA silencing. To test this hypothesis, 
we obtained animals that produce synthetic piRNAs 
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Fig. 4. Germline-expressed genes exhibit resistance to piRNA 
silencing through their intrinsic signals, such as PATCs. (A) Numbers 
of predicted piRNA sites on germline-expressed RNA transcripts. To 
predict more confident targeting sites, the stringent piRNA targeting 
criteria are used here, in which up to one GU wobble pair was allowed in 
the seed region, and overall only up to two mismatches plus an additional 
GU mismatch were allowed. In addition, the mismatch at the first 
nucleotide of the piRNA is not counted or considered. The RNA targets of 
CSR-1 Argonaute (CSR-1 targets) are used to define the germline- 
expressed genes. (B) Quantitative reverse transcription polymerase chain 
reaction measurements of (left) the abundance of the synthetic piRNAs 
in comparison with the level of endogenous 21U-5499 (value = 1) in the 
control strain and (right) the expression levels of corresponding mRNA 
targets in the indicated strains. nop-1—, cdk-1-, and oma-I-targeting piRNAs 
were produced by gene-edited animals, whereas pie-1-targeting piRNAs 
were produced by animals carrying extrachromosomal arrays. Error bars 
represent standard error of the mean from biological duplicated samples. 
The statistics for synthetic piRNA expression were calculated by comparing 
the levels of specific piRNAs and 21U-5499. n.s., not significant; *P < 0.05, 
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**P < 0.01, ***P < 0.001, Student's t test. (©) A box and whisker blot 
showing the density of PATC in the germline-specific and somatic-specific 
genes. ***P < 0.001, Wilcoxon rank sum test. (D) The density of 22G-RNAs 
within a 100-nt window around predicted piRNA target sites of germline- 
specific transcripts with high PATC density (PATC > 50) or low PATC 
density (PATC < 10). The plots are centered at the sequence targeted 

by piRNAs (green). The stringent piRNA targeting criteria were used 

here to predict piRNA target sites. n = number of predicted piRNA sites. 
(E) The box-and-whisker plots showing the number of predicted piRNA- 
targeted sites on germline-expressed genes that contain the indicated 
range of PATC density. The stringent piRNA targeting criteria were used 
here to predict piRNA target sites. n.s., not significant; ***P < 0.001, 
Wilcoxon rank sum test. (F) Fluorescence micrographs showing the 
expression of the original mCherry::ANI-1°!"9 harboring synthetic introns 
(no PATC) and mCherry™°::ANI-198!4459 harboring PATC-containing 
introns. (G) 22G-RNA distribution at mCherry coding sequence of the 
indicated transgenes. Each bar indicates the first nucleotide position and 
abundance of 22G-RNAs. The red bars mark the location of piRNA 
targeting sites predicted by using the relaxed piRNA targeting criteria. 
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that are perfectly complementary to several of these 
genes, including pie-I, nop-I, cdk-1, and oma-I. We 
engineered these piRNAs using the same locus 
(21U-5499) we used for producing g/p-targeting 
piRNAs, and these synthetic piRNAs were ex- 
pressed at similar levels (Fig. 4B). In the animals 
expressing synthetic piRNAs targeting endoge- 
nous genes, we did not observe a reduction of 
mRNA levels or the phenotypes associated with 
silencing of these genes (Fig. 4B and fig. S4, A to 
D). In addition, no phenotype associated with 
silencing was observed in animals expressing 
any of six additional synthetic piRNAs that tar- 
get various regions of the pie-I or nop-I1 genes 
(fig. S4, A and B). This is in stark contrast to the 
animals expressing seven distinct g/p or mCherry- 
targeting piRNAs, which all trigger potent silenc- 
ing of GFP*®™A::CDK-1 or mCherry®?!™™4ANT-1, 
respectively (Fig. 4B and figs. $4, E and F). To- 
gether, our results suggested that at least some 
endogenous germline genes exhibit resistance to 
piRNA-mediated gene silencing in C. elegans. 

Previous studies have proposed that CSR-1 
Argonaute-associated 22G-RNAs may form an 
epigenetic memory of “self” to promote gene ex- 
pression in the germline (30, 37). We therefore ex- 
amined whether the nop-J-targeting piRNA can 
trigger nop-1 silencing in csr-1 mutants. In either 
csr-I F2 one-cell embryos with or without the treat- 
ment of csr-] RNAi, we did not observe that the syn- 
thetic piRNA conferred the phenotype associated 
with nop-I silencing (fig S5A). To further test 
whether the piRNA resistance of endogenous genes 
is mediated by epigenetic signals, we used a Cas9- 
based gene-editing strategy to delete the nop-1 
gene and its untranslated regions (UTRs) from the 
genome, which would remove all chromatin-based 
signals as well as the DNA/RNA templates that 
are required to produce CSR-1 22G-RNAs targeting 
nop-I (fig S5B). We then reinserted the nop-I gene 
back to its original locus or to the locus where our 
silenced transgenes are inserted. The reinserted 
nop-1 gene remained resistant to silencing by 
endogenous or synthetic nop-I-targeting piRNA 
(fig. S6, C and D). Although these data represented 
negative results, our analyses provided no evidence 
for epigenetic mechanisms in licensing germ- 
line gene expression. 

We therefore investigated whether intrinsic se- 
quences of germline genes provide resistance to 
piRNA silencing. One such candidate is 10-base pe- 
riodic An/Tn clusters (PATCs), an intrinsic DNA se- 
quence element found in the introns or promoters 
of some germline genes in C. elegans (32). A recent 
study has reported that PATCs can promote the ex- 
pression of transgenes inserted at heterochromatin 
in C. elegans, but whether PATCs can provide re- 
sistance to piRNA silencing has not been ex- 
plored (33). We found that PATCs are enriched in 
germline genes and particularly enriched in the 
germline-expressed genes (Fig. 4C and fig. S6, A 
and B). To examine the global effect of PATCs on 
piRNA silencing, we compared the local 22G-RNA 
distribution at predicted piRNA sites in germline- 
specific transcripts with high or low PATC density. 
We observed that local 22G-RNAs accumulated 
around the targets only for genes with low, but not 
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high, PATC density (Fig. 4D and fig. S6, C and D). 
These observations imply that PATCs negatively 
affect the ability of the piRNA pathway to induce 
and/or maintain 22G-RNAs at the piRNA targeting 
sites. Furthermore, we observed that germline- 
specific genes of higher PATC density contain more 
predicted piRNA targeting sites than genes with 
lower PATC density (Fig. 4E and fig. S7, A and B), 
implying that PATCs allow the expression of germ- 
line genes despite harboring piRNA targeting sites. 
Last, if PATCs can provide resistance to piRNA si- 
lencing, insertion of PATC introns to a silencing- 
prone transgene should license its expression. 
Indeed, replacing the mCherry introns of the 
mCherry::ANI-1 transgene with PATC-containing 
introns from the smu-J gene led to the stable ex- 
pression of the transgene (Fig. 4F). Small-RNA 
sequencing further showed that dramatically fewer 
mCherry antisense 22G-RNAs are produced in the 
worms carrying mCherry’“7°::ANT-1%!™ than in 
those carrying the original mCherry::ANTI-1%!" 
(Fig. 4G). Together, our findings suggest that 
PATCs act as a licensing signal that provides re- 
sistance to piRNA silencing. 

Overall, our study revealed the piRNA targeting 
logic in C. elegans. In addition, our research suggested 
that diverse piRNAs can recognize and silence var- 
ious foreign nucleic acids because of their broad 
targeting capacity. Because several different modes 
of miRNA targeting have been described in ani- 
mals and plants (22, 34-38), additional modes of 
piRNA-targeting are likely to exist as well. Nonethe- 
less, our study demonstrated that piRNA-mediated 
gene silencing underlies the transgene silencing 
phenomenon in the germline of C. elegans and 
provided a simple solution to achieve transgene 
expression by avoiding piRNA recognition. 

Our study showed that many endogenous genes 
also contain piRNA targeting sites but exhibit 
resistance to piRNA silencing. Our analyses sug- 
gested PATCs to be a licensing signal protecting 
endogenous genes from piRNA silencing. How 
PATCs counter against piRNA silencing remains 
unknown. A recent study showed that PATCs are 
enriched in germline genes within repressive 
chromatin domain, suggesting that PATCs may 
prevent piRNAs from establishing heterochromatin 
at their target (33). Our data suggest that PATCs 
function not simply by promoting euchromatin 
formation but also by inhibiting the production 
of 22G-RNA at piRNA targeting sites (Fig. 4, D 
and G). If so, it will suggest that the formation of 
heterochromatin may feedback to promote the 
production of 22G-RNAs. Such a relationship 
between chromatin and small-RNA production 
is reminiscent of RNA-induced transcriptional 
silencing described in Schizosaccharomyces pombe, 
in which small-RNA-guided heterochromatin re- 
cruits RdRPs to produce more small RNAs (39). 
In addition, because our data suggested that some 
endogenous genes—such as nop-1, cdk-1, or oma-1— 
exhibit resistance to piRNA silencing despite low 
PATC density (Fig. 4B) (33), other mechanisms 
may exist to license self genes for expression. 
Taken together, our studies revealed a strategy 
by which C. elegans defends its genome against 
foreign nucleic acids, whereby diverse piRNAs 
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silence foreign genes that are not licensed for 
expression. 
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COLON CANCER 


Patients with familial adenomatous 
polyposis harbor colonic biofilms 
containing tumorigenic bacteria 


Christine M. Dejea,?* Payam Fathi,»?**+ John M. Craig,* Annemarie Boleij,’° 
Rahwa Taddese,” Abby L. Geis,”?+ Xinqun Wu,” Christina E. DeStefano Shields,” 
Elizabeth M. Hechenbleikner,*®§ David L. Huso,’|| Robert A. Anders,® 

Francis M. Giardiello,””* Elizabeth C. Wick,°{ Hao Wang,” Shaoguang Wu,” 
Drew M. Pardoll,”” Franck Housseau,”” Cynthia L. Sears’”’*# 


Individuals with sporadic colorectal cancer (CRC) frequently harbor abnormalities in the 
composition of the gut microbiome; however, the microbiota associated with precancerous 
lesions in hereditary CRC remains largely unknown. We studied colonic mucosa of patients 
with familial adenomatous polyposis (FAP), who develop benign precursor lesions (polyps) 
early in life. We identified patchy bacterial biofilms composed predominately of Escherichia 
coli and Bacteroides fragilis. Genes for colibactin (clbB) and Bacteroides fragilis toxin (bft), 
encoding secreted oncotoxins, were highly enriched in FAP patients’ colonic mucosa compared 
to healthy individuals. Tumor-prone mice cocolonized with E. coli (expressing colibactin), 
and enterotoxigenic B. fragilis showed increased interleukin-17 in the colon and DNA 
damage in colonic epithelium with faster tumor onset and greater mortality, compared to 
mice with either bacterial strain alone. These data suggest an unexpected link between 
early neoplasia of the colon and tumorigenic bacteria. 


olorectal cancer (CRC) is very common 

globally and develops through accumu- 

lation of colonic epithelial cell (CEC) mu- 

tations that promote transition of normal 

mucosa to adenocarcinoma. Around 5% 
of CRC occurs in individuals with an inherited 
mutation (7). One hereditary condition, famil- 
ial adenomatous polyposis (FAP), is caused by 
germline mutation in the APC tumor suppres- 
sor gene. Individuals with FAP are born with 
their first mutation in the transition to CRC, 
and as somatic mutations accumulate, devel- 
op hundreds to thousands of colorectal polyps. 
The onset and frequency of polyp formation 
within families bearing the same APC gene 
mutation varies substantially (2), suggesting 
that additional factors contribute to disease 
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onset, including components of the micro- 
biome (3). 

The colon contains trillions of bacteria that are 
separated from the colonic epithelium by a dense 
mucus layer. This mucus layer promotes tolerance 
to foreign antigens by limiting bacterial-epithelial 
cell contact and, thus, mucosal inflammatory re- 
sponses. In contrast, bacterial breaches into the 
colonic mucus layer with, in some, biofilm forma- 
tion fosters chronic mucosal inflammation (4-6). 

We previously reported that biofilms on nor- 
mal mucosa of sporadic CRC patients were asso- 
ciated with a pro-oncogenic state (6, 7), suggesting 
that biofilm formation is an important epithe- 
lial event influencing CRC. To test the hypoth- 
esis that biofilm formation may be an early event 
in the progression of hereditary colon cancer, we 
examined the mucosa of FAP patients at clin- 
ically indicated colectomy. 

We initially screened surgically resected tis- 
sue preserved in Carnoy’s fixative from five pa- 
tients with FAP and one with juvenile polyposis 
syndrome (table S1). Colon biopsies from indi- 
viduals undergoing screening colonoscopy or 
surgical resections served as controls (7 = 20, 
table S2). Polyps and macroscopically normal 
tissue were labeled with a panbacterial 16S ribo- 
somal RNA (rRNA) fluorescence in situ hybrid- 
ization (FISH) probe. Each FAP patient exhibited 
bacterial invasion through the mucus layer scat- 
tered along the colonic axis (Fig. 1A, table S3, 
and fig. S1). Unlike the continuous mucosal bio- 
films in sporadic CRC patients (6), FAP tissue 
displayed patchy bacterial mucus invasion (aver- 
age length, 150 um) on ~70% of the surgically 
resected colon specimens collected from four of 
six hereditary tumor patients. Biofilms were not 
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restricted to polyps, nor did they display right 
colon geographic preference as observed in spo- 
radic CRC (table S3 and figs. S1 and S2). Bio- 
films were not detected in the mucus layer of 
the FAP patient who received oral antibiotics 
24 hours before surgery (table S1 and fig. S2). 

Specimens with bacterial biofilms were fur- 
ther screened by additional 16S rRNA probes to 
recognize the major phyla detected in biofilms 
of sporadic CRC; namely, Bacteroides/Prevotella, 
Proteobacteria, Lachnospiraceae, and Fusobacteria 
(table S4). Notably, FAP biofilms were composed 
predominantly of mucus-invasive Proteobacteria 
(~60 to 70%) and Bacteroides (10 to 32%) 
(table S3). Fusobacteria were not detected, and 
Lachnospiraceae were rare (<3%) by quantita- 
tive FISH analysis (table S3). 

Additional probe sets (table S4) identified the 
predominant biofilm members as £. coli and 
B. fragilis (Fig. 1A, bottom panels; table S3). 
Invasion of the epithelial cell layer by biofilm 
community members was detected in all patients 
harboring biofilms (Fig. 1B), a finding similar 
to that in sporadic CRC patients. Further, FISH 
of mucosal biopsies from ileal pouches or ano- 
rectal remnants of additional, longitudinally 
followed, postcolectomy FAP patients revealed 
biofilms in 36% and mucosal-associated E. coli 
or B. fragilis in 50% (table S5). Thus, E. coli and 
B. fragilis are frequent, persistent mucosal colo- 
nizers of the FAP gastrointestinal tract. Of note, 
semiquantitative colon mucosa bacterial cul- 
tures of Ape™”’"47°/* mice (truncation at the 
716 codon of Apc), a murine correlate of FAP, 
displayed similar enrichment of Bacteroides 
and Enterobacteriaceae compared to wild-type 
(WT) littermates, consistent with data reported 
for Apc™'"495°/* mice (fig. S3) (8). These results 
suggest that Apc mutations enhance mucosal bac- 
terial adherence, altering the bacterial-host epi- 
thelial interaction. 

Strong experimental evidence exists support- 
ing the carcinogenic potential of molecular sub- 
types of both E. coli and B. fragilis (9, 10); the 
two dominant biofilm members identified in di- 
rect contact with host colon epithelial cells in our 
FAP patients. EZ. coli containing the polyketide 
synthase (pks) genotoxic island (encodes the 
genes responsible for synthesis of the colibactin 
genotoxin) induces DNA damage in vitro and 
in vivo along with colon tumorigenesis in azoxy- 
methane (AOM)-treated interleukin-10 (IL-10)- 
deficient mice (10), whereas, enterotoxigenic 
Bacteroides fragilis (ETBF) induces colon tumori- 
genesis in Ape“””* mice (9). Human epidemio- 
logical studies have associated ETBF and pks+ 
E. coli with inflammatory bowel disease and 
sporadic CRC (10-13). Thus, we cultured banked 
frozen mucosal tissues from 25 FAP patients (two 
polyps and two normal tissues per patient when 
available, table S1) and 23 healthy individuals 
(mucosal sample from surgical resection or one 
ascending and one descending colon biopsy 
per colonoscopy subject, table S2) for the pres- 
ence of pks+ E. coli and ETBF. The mucosa of 
FAP patients was significantly associated with 
pks+ E. coli (68%) and ETBF (60%) compared 
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to healthy subject mucosa (22% pks+ E. coli and 
30% ETBF) (Fig. 1C). There was no preferential 
association of ETBF or pks+ E. coli with polyp 
or normal mucosa from FAP patients. Typically, 
mucosal samples from individual patients were 
concordant for pks+ E. coli or ETBF (73% for 
pks+ E. coli, 59% for ETBF), similar to results for 
mucosal ft detection in sporadic CRC patients 
(13). Notably, pks+ E. coli and ETBF mucosal co- 
association occurred at a higher rate (52%) than 
expected to occur randomly (40.8%) given the 
frequencies for the individual species (Fig. 1C). 
Increased mucosal coassociation also occurred 
in healthy control subjects (22% observed versus 
6.6% expected) (Fig. 1C). Laser capture micro- 
dissection of mucosal biofilms from our initial 
FAP patients (fig. S2 and table S1) contained both 
bft and clbB as determined by polymerase chain 
reaction (PCR) analysis, indicating that the car- 


A Polyp 


cinogenic subtypes of B. fragilis and E. coli, re- 
spectively, were present in the mucus layer in 
direct contact with the FAP epithelium (Fig. 1D). 
In contrast, neither virulence gene was detected 
in the mucus layer of control subject 3760 whereas 
bft was detected in the mucus layer of control 
subject 3730, consistent with our prior reported 
culture analysis of this sample (Fig. 1D) (13). 
The high frequency of pks+ E. coli and ETBF 
cocolonization in FAP colons highlights the im- 
portance of understanding the potential effects 
of simultaneously harboring these two carcino- 
genic bacteria. Consequently, we used two mu- 
rine models, AOM treatment without dextran 
sodium sulfate (see materials and methods) and 
Apc™479* mice to test the hypothesis that pks+ 
E. coli and ETBF cocolonization enhances colon 
tumorigenesis compared to monocolonization 
with either bacterium. The rate of spontaneous 


Paired Normal 


Cc 
Total pks+ E. coli ETBF 
FAP 25 17 (68%) 15 (60%) 
Controls 23 5 (22%) 7 (30%) 
p value 0.002 0.049 


D 
400 =a 
7 Sci FEaan 300—.~ 
pks+E. coli, ETBF Neither 200 
13 (52%) 6 (24%) 
5 (22%) 16 (70%) 100-2 
0.0400 0.0030 


colon tumorigenesis is very low in both model 
systems. 

Specific pathogen-free wild-type mice were 
treated with the carcinogen AOM and monoin- 
oculated or coinoculated with canonical strains 
of pks+ E. coli (the murine adherent and inva- 
sive strain, NC101) and ETBF (strain 086-5443-2-2) 
(9, 10). Fecal ETBF or pks+ E. coli colonization 
was similar under monocolonization or cocolo- 
nization conditions, persisting until colon tumor 
formation was assessed at 15 weeks after colo- 
nization (fig. S4). Monocolonized (pks+ E. coli or 
ETBF) mice displayed few to no tumors. However, 
pronounced tumor induction occurred in co- 
colonized mice, including an invasive cancer, 
suggesting the requirement for both bacteria 
to yield oncogenesis (Fig. 2, A to C). Tumori- 
genesis required the presence of BFT and the 
colibactin genotoxin as in-frame deletions of 


3775 3975 3995 3971 3760 3730 NC101 ETBF 


: ae =a] 
—_) | iw = <5rT 
---- — €cibB 
| 


Fig. 1. Fluorescent in situ hybridization (FISH) and microbiology 
culture analysis of FAP mucosal tissues. (A) Top panels: Representative 
FISH images of bacterial biofilms (red) on the mucosal surface of a 

FAP polyp and paired normal tissues counterstained with DAPI (4’,6- 
diamidino-2-phenylindole) nuclear stain (blue). Middle panels: Most of 
the biofilm composition was identified as B. fragilis (green) and E. coli (red) 
by using species-specific probes. Bottom panels: PAS (periodic acid— 
Schiff)—stained histopathology images of polyp and paired normal 
mucosal tissues demonstrating the presence of the mucus layer. Images 
were obtained at 40x magnification; scale bars, 50 tum. Dotted lines 
delineate the luminal edge of the colonic epithelial cells. Images are 
representative of n = 4 to 23 tissue samples per patient screened (at least 
10 5-tum sections screened per patient). (B) Enterobacteriaceae (yellow) 
and E. coli (red) FISH probes on paired normal FAP tissue (100x 
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magnification) revealing invasion into the epithelial cell layer at the base 
of a crypt (arrows). Bottom panels with insets of Enterobacteriaceae (bottom 
eft panel) in yellow, E. coli (bottom middle panel) in red, and overlay 
(bottom right panel) confirming identification of the invasive species. Scale 
bar, 20 um. Images are representative of n = 5 to 16 tissue samples per 
patient screened (at least 10 5-tum sections screened per patient). (©) FAP 
and control prevalence of pks+ E. coli and enterotoxigenic Bacteroides 
fragilis (ETBF). Chi-square P-values are shown that represent the difference 
in probability of detection of each bacterium in FAP versus control 
patients. (D) PCR detection of c/bB (a gene in the pks island) and bft within 
aser-captured biofilms containing E. coli and B. fragilis from designated 
FAP patients (table S1) and controls (table S2; materials and methods). 
Data show a representative image from two independent experiments with 
two or three replicates per experiment performed. 
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Fig. 2. Cocolonization by pks+ E. coli and ETBF increases colon tumor 
onset and mortality in murine models of CRC. (A) Total colon tumor 
numbers detected in sham (n = 9), ETBF monocolonized (n = 12), pks+ 
E. coli monocolonized (n = 11), pks+ E. coli/ETBF cocolonized (n = 13), 

E. coliAoks/ETBF (n = 9), or pks+ E. coli/ETBFAbft (n = 10) AOM mice 

at 15 weeks after colonization. Data indicate mean + SEM. Overall significance 
was calculated with the Kruskal-Wallis test, and the overall P value is 
shown; Mann-Whitney U was used for two-group comparisons; **P = 
0.016, ****P < 0.0001. (B) Representative colons of monocolonized (ETBF 
or pks+ E. coli), cocolonized (ETBF/pks+ E. coli), E. coliApks/ETBF, and 
pks+ E. coli/ETBFAbft mice at 15 weeks after colonization of AOM-treated 
mice. Images are representative of n = 9 to 13 mice for each group. 
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(C) H&E (hematoxylin and eosin) histopathology of an invasive adeno- 
carcinoma in a cocolonized (pks+ E. coli/ETBF) AOM mouse at 15 weeks. 
Main image, 10x magnification; scale bar, 1 mm. Inset image, 100x 
magnification; scale bar, 0.2 mm. Blue arrow depicts the disruption of 
the muscularis propria by the invasive adenocarcinoma, and white 
arrows (inset) identify invading clusters of adenocarcinoma epithelial 
cells. (D) Kaplan-Meir survival plot of Apc476“""“* mice (n = 30) 
colonized with either ETBF (blue; n = 10), pks+ E. coli (orange; n = 10), or 
cocolonized with pks+ E. coli and ETBF (purple; n = 10). Cocolonization 
significantly (P < 0.0001) increased the mortality rate. Statistics were 
analyzed with the log-rank test. All surviving mice (n = 19) were harvested 
at 110 days. 


the bft gene and the pks virulence island signif- 
icantly decreased tumors (Fig. 2A). 
Apc™479/*mice cocolonized with ETBF and 
pks+ E. coli exhibited enhanced morbidity with 
rapid weight loss and significantly increased 
mortality (P < 0.0001) [loss of 80% of the mice 
(n = 8) by 8 weeks and the remaining 20% (n = 2) 
at 12 weeks after colonization]. In contrast, 90% 
(n = 9) and 100% (n = 10) of mice monocolonized 
with ETBF or pks+ E. coli, respectively, survived 
15 weeks after colonization (Fig. 2D). The robust 
tumorigenesis of ETBF alone (at 15 weeks) and 
cocolonized mice (majority deceased by 8 weeks 
after colonization) was similar, whereas tumor 
numbers were significantly increased in the co- 
colonized cohort compared to pks+ E. coli alone 
(fig. S5). Notably, at early time points, inflam- 
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mation was increased in the cocolonized cohort 
compared to either ETBF or pks+ E. coli alone 
(fig. S5). Together these results suggest that the 
significant increase in colon inflammation and 
early tumorigenesis in the cocolonized mice con- 
tributed to their earlier mortality in the Ape™”””* 
mouse model. 

Consistent with enhanced tumorigenesis, his- 
topathological analysis revealed significantly 
increased colon hyperplasia and mucosal micro- 
adenomas in cocolonized AOM-treated mice 
compared to monocolonized mice (Fig. 3A and 
fig. S6A). However, histopathology scoring re- 
vealed modest differences in inflammation over 
time (4 days to 15 weeks) in mono- and cocolo- 
nized AOM mice (Fig. 3B and fig. S6B). Thus, 
overall inflammation did not seem to explain 
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differential tumor induction. To determine if the 
type of inflammation contributed to differences 
in tumorigenesis, we analyzed lamina propria 
immune-cell infiltrates of monocolonized and 
cocolonized wild-type AOM mice by flow cytom- 
etry. Our general lymphoid panel revealed a 
marked B cell influx across all colonization 
groups (Fig. 3C) but no significant differences 
in the proportion of infiltrating T cells (CD4, 
CD8, or yé T cells) and myeloid populations 
between monocolonized and cocolonized AOM 
mice (Fig. 3C) either at the acute (1-week) or 
chronic (3-week) stage of infection. 

Of particular interest was IL-17, as the tumori- 
genic potential of ETBF in Ape™’”47* mice has 
been attributed, in part, to IL-17 (9). Because bft 
was necessary for synergistic tumor induction 
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Fig. 3. IL-17—induced inflammation is necessary for bacterial-driven 
tumorigenesis. (A) Histologic hyperplasia and (B) inflammation scores of 
15-week AOM sham (n = 9), ETBF monocolonized (n = 12), pks+ E. coli 
monocolonized (n = 11), or pks+ E. coli/ETBF cocolonized (n = 13) mice. 
Data represent mean + SEM of three independent experiments. For 

(A) and (B), overall significance was calculated by using the Kruskal-Wallis 
test, and the overall P value is shown; Mann-Whitney U was used for 
two-group comparisons; **P = 0.01, ***P = 0.0014, ****P = 0.0006; NS, 
not significant. (©) Myeloid and lymphoid lamina propria immune cell 
infiltrates plotted as percentage of live cells in AOM mice at day 7 (top 
panels) and day 21 (bottom panels) after colonization. Data represent 
mean + SEM of three independent experiments (total three to five mice 
per group). (D) Total tumor numbers detected in IL-17—deficient AOM- 
treated mice (IL17-’-) versus wild-type AOM mice (WT). Both mouse 
strains were cocolonized with pks+ E. coli and ETBF and tumors assessed 
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at 15 weeks. Data represent mean + SEM of two or three independent 
experiments (total 6 to 13 mice per group). Significance calculated by the 
Mann-Whitney U test represents differences between the non-normally 
distributed colon tumors in the independent mouse groups. (E) IL-17- 
producing cell subsets and total number of IL-17—producing (IL-17tot) cells 
per colon harvested from germ-free C57BL/6 mice monocolonized with 
pks+ E. coli or ETBF or cocolonized with pks+ E. coli and ETBF for up 

to 60 hours. Data represent mean + SEM of two independent experiments 
(total 3 to 5 mice per group). Overall significance across IL-17—producing 
cell types was calculated by using two-way analysis of variance testing 
based on log-transformed data (bold P value). For each cell subset 

and total number of IL-17—producing cells (gray dotted line box), the overall 
P value is shown and was calculated by using the Kruskal-Wallis test. 
Two-group cell subset and total number of IL-17—producing cell comparisons 
were analyzed by Mann-Whitney U test and are reported in table S7. 


under cocolonization conditions (Fig. 2A), we 
tested the role of IL-17 in the cocolonized AOM 
model. Although IL-17 expression analysis by 
quantitative PCR revealed no significant dif- 
ference in overall mucosal IL-17 mRNA levels 
between 15-week ETBF monocolonized and ETBF 
and pks+ E. coli cocolonized mice (fig. S7), cocol- 
onization of IL-17-deficient AOM mice ablated 
tumorigenesis (Fig. 3D). To specifically test 
whether ETBF and pks+ E. coli cocolonization 
affected early colon mucosal IL-17 production, 
germ-free C57BL/6 mice were mono- or cocol- 
onized and innate and adaptive lymphocyte sub- 
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sets analyzed by flow cytometry. Germ-free mice 
cocolonized with ETBF and pks+ E. coli dis- 
played a trend toward increase in total mucosal 
IL-17-producing cells when compared to mono- 
colonized ETBF or pks+ E. coli mice, driven by 
both adaptive [T helper 17 (Ty17)] and innate 
(particularly yéT17) cells (Fig. 3E and table $7). 
Although necessary for tumorigenesis (Fig. 3D), 
IL-17 alone appears insufficient to explain syner- 
gistic tumorigenesis in cocolonized mice because 
robust IL-17 induction by ETBF monocolonization 
(fig. S7) induces only meager colon tumorige- 
nesis in AOM mice (Fig. 2A). 
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Because our general lymphoid panel revealed 
a marked B cell influx across all colonization 
groups (Fig. 3C), we profiled the secretory im- 
munoglobulin A (IgA) response by IgA enzyme- 
linked immunosorbent assay (ELISA) using stool 
collected 4 weeks after colonization from AOM 
mice. Cocolonized mice had a significantly more 
robust IgA response to pks+ E. coli than mice 
monocolonized with pks+ E. coli, whereas the 
fecal anti-ETBF IgA response was similar under 
mono- and cocolonization conditions (Fig. 4A). 
Thus, the increased fecal IgA response was spe- 
cific to pks+ E. coli in mice cocolonized with ETBF, 
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Fig. 4. ETBF enhances pks+ E. coli colonization and colonic epithelial 
cell DNA damage. (A) ELISA results showing anti-pks+ E. coli (NC101) 
IgA and anti-ETBF (86-5443-2-2) IgA present in fecal supernatants from 
wild-type AOM mice under the designated colonization conditions for 4 weeks. 


Data represent mean + SEM of three independent experiments (total 


3 to 10 mice per group). (B) Colonization of dista 


colon mucosae by pks+ 


E. coli and ETBF under mono- and cocolonization conditions at 4 weeks 


in AOM mice. Data represent mean of three indepe 
of 15 mice per group). (©) Mucus depth (11m) of HT 
under the designated colonization conditions. Data 


suggesting that cocolonization enhanced muco- 
sal exposure to pks+ E. coli. 

Although fecal colonization of both pks+ E. coli 
and ETBF was equivalent under both mono- and 
cocolonization conditions (fig. $4), quantifica- 
tion of mucosal-adherent ETBF and pks+ E. coli 
revealed a marked increase in mucosal-adherent 
pks+ E. coli under cocolonization conditions com- 
pared to pks+ E. coli monocolonization (Fig. 4B). 
Hence, under monocolonization conditions, 
pks+ E. colt is largely cultivatable only from the 
colon lumen whereas in the presence of ETBF, 
pks+ E. coli colonizes the mucosa at high lev- 
els (10? to 10° colony-forming units per gram of 
tissue). Using Muc-2-producing HT29-MTX-E12 
monolayers in vitro, we tested the impact of 
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ndent experiments (total 
129-MTX-E12 monolayers 
represent mean + SEM 


of three independent experiments. A. muc, Akkermansia muciniphila. 

(D) Representative images of y-H2AX immunohistochemistry of distal colon 
crypts from AOM mice (five mice per condition) mono- or cocolonized 
with pks+ E. coli and ETBF for 4 days with quantification (right panel) of 
y-H2AX-—positive cells displayed as percentage positive per crypt (see 
materials and methods). Data represent mean + SEM of three independent 
experiments. For (A), (B), and (D), significance was calculated with the 
Mann-Whitney U test for two-group comparisons; for (C), overall significance 
was calculated with the Kruskal-Wallis test and the overall P value is shown; 
Mann-Whitney U was used for two-group comparisons; ****P < 0.0001. 


pks+ E. coli and ETBF on mucus. Although 
pks+ E. coli colonization alone had no impact 
on mucus depth, monolayer colonization with 
ETBF alone or cocolonized with pks+ E. coli sig- 
nificantly reduced mucus depth similar to colo- 
nization with A. muciniphila a known human 
colonic mucin-degrading bacterium (Fig. 4C). 
These results suggest that mucus degradation 
by ETBF promotes enhanced pks+ E. coli col- 
onization. Such a shift in the bacterial niche 
of pks+ E. coli would facilitate the delivery of 
colibactin, the DNA-damaging toxin released 
by pks+ E. coli, to colon epithelial cells. Con- 
sistent with this hypothesis, y-H2AX immuno- 
histochemistry revealed significantly enhanced 
DNA damage in the colon epithelial cells of AOM 
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mice cocolonized with pks+ E. coli and ETBF 
compared to monocolonized (pks+ E. coli or 
ETBF) mice (Fig. 4D). Further, mice cocolonized 
with ETBF and E. coliApks displayed similarly 
enhanced mucosal colonization with the E. coli 
strain (fig. S8) but reduced tumors and no in- 
crease in DNA damage or IL-17 (Fig. 2A and 
fig. S9, A and B, respectively). Lastly, persistent 
cocolonization of AOM-treated mice with the 
mucin-degrading A. muciniphila and pks+ E. coli 
did not enhance, but rather reduced, the modest 
colon tumorigenesis (fig. S10, A and B) induced 
by pks+ E. coli monocolonization. These results 
suggest that mucus degradation alone was in- 
sufficient to promote pks+ E. coli colon carcino- 


genesis in AOM mice. 
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Taken together, these data suggest that co- 
colonization with ETBF and pks+ E. coli, found 
in more than half of FAP patients (in contrast to 
less than 25% of controls), promotes enhanced 
carcinogenesis through two distinct but com- 
plementary steps: (i) mucus degradation enabling 
increased pks+ E. coli adherence, inducing in- 
creased colonic epithelial cell DNA damage by 
colibactin (Fig. 4D and fig $9); and (ii) IL-17 
induction promoted, primarily, by ETBF with 
early augmentation by pks+ E. coli cocoloniza- 
tion (Fig. 3, D and E, and table $7). We propose 
that together these mechanisms yield coopera- 
tive tumor induction in AOM mice cocolonized 
with ETBF and pks+ E. colt. 

ETBF and pks+ E. coli commonly colonize 
young children worldwide. Thus, our results sug- 
gest that persistent cocolonization in the colon 
mucosa from a young age may contribute to the 
pathogenesis of FAP and potentially even those 
who develop sporadic CRC because APC loss or 
mutation occurs in the vast majority of sporadic 
CRC. We note that pks+ E. coli are phenotypic 
and genotypic adherent and invasive E. coli 
(AIEC) (14). Despite this designation, derived 
primarily from in vitro cell culture experiments, 
the canonical pks+ E. coli strain (NC101) used in 
our experiments was only cultivatable from the 
colon lumen in the absence of concomitant ETBF 
colonization in our mouse model. This ETBF- 
dependent shift to marked mucosal pks+ E. coli 
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colonization is consistent with our observations 
that ETBF and pks+ E. coli cocolonize FAP colon 
biofilms, where both bacteria invade and cocolo- 
nize the mucus layer throughout the FAP colon. 
These findings suggest that analysis of coexpres- 
sion of bff and clbB may have value in general 
screening and potential prevention of CRC. 
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into single use tubes for frozen storage 
can reduce the potential for repeated 
exposure to contaminants. Aliquoting 


component of culture media for most ap- 
plications. Serum is often one of the most 
abundant constituents of cell culture media 
(typically 2-15% by volume), so its quality 
can have significant impact on culture suc- 
cess. FBS in particular is the most widely- 
used media supplement for the culture of t's also usefu 
mammalian cells due to its ready availability, 
along with its rich supply of growth factors 
that promote the development and expan- 
sion of cells in vitro. Serum also acts as a 
buffer to the cell culture system against a 
variety of toxic effects that can disrupt cell 
growth such as pH change, proteolytic 
activity, or the presence of endotoxin. 


also helps to prevent cloudiness that can 
arise from the breakdown of lipids and 
lipid-associated macromolecules as a con- 
sequence of repeated freeze-thaw cycles. 


thaw cycles. Although the presence of the clotting protein 
fibrin is sometimes blamed for serum cloudiness, serum is 
prepared from the post-coagulation fraction of blood, and 
therefore would not typically contain significant amounts of 
fibrin or other clotting factors. 


to realize that lipid content contributing to cloudiness in serum can vary signifi- 
cantly among species. For example (and perhaps not surprisingly), porcine serum is higher in 
ipid content than its bovine counterpart. So, serum from diverse biological sources requires 
unique handling and additional processing steps at the manufacturing facility in order to prepare 
the final product for use in the lab. Suppliers that offer sera from multiple species will therefore 
have differential protocols in place for processing raw materials in order to provide a consistent, 
appropriate serum product to the end user, regardless of lipid biology. 


World Geography: 

understanding sourcing and countries of origin 

Because serum is a biological product, it is subject to variations that arise from the health of the 
source animals, which is in turn influenced by dietary, environmental, and global disease factors. 
One of the most important attributes knowledgeable serum product users therefore seek is the 
country of origin, or source. 


General Biology: 

how to assess and handle 

serum in the lab 

Anyone who has ever ordered serum for the 
lab knows that delivery timing is important, 
as bottles will typically arrive frozen from the 
supplier. This is because the activity of criti- 
cal macromolecular factors in serum are 
best protected by storing serum at -5 to 
-20°C, and maintaining serum in the frozen 
state until use will also reduce the incidence 
of events like the breakdown of lipids. Serum 
is thawed overnight at 2-8°C to prevent 
degradation; alternatively, it may be thawed 
at room temperature with periodic agita- 
tion to resuspend the contents. Thawed 
serum should be mixed thoroughly before 
being added to cell culture medium. Finally, 
temperatures above 37°C may compromise 
serum performance by accelerating the deg- 
radation of critical serum factors, and should 
be avoided in the lab. 


Although the incidence and impact of various species-specific pathogens such as bovine spongi- 
form encephalopathy (BSE) on serum are still being studied, the USDA maintains lists of coun- 
tries approved for importation of serum that are based in part on up-to-date pathogen incidence 
data. Serum consumers should therefore always expect suppliers to provide source information 
for their sera and other products of animal origin. 


Serum can be sensitive to environmental 
conditions beyond temperature. For exam- 
ple, high intensity/high energy wavelengths 
of light may degrade the activity of growth 
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The USDA maintains a list of countries approved for importation of serum into the U.S. (guidelines revised annually). This 
differs from U.S.-origin serum, which is collected from facilities in the U.S. that undergo continuous inspection by the USDA. 
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International Regulatory Practices 
Reputable suppliers will typically employ consen 


laboratories meets standards for levels of common contaminants 


sus testing methods specified by the serum industry to ensure that serum supplied to 
that may impact its performance in cell culture. Some common tests are: 


Contaminant/Parameter Test Standard (if specified) 
Endotoxin Limulus amebocyte lysate (LAL) Test and record 
Bacteria/fungi Broth or solid culture of a serum sample Not detected 
Viruses (various) PCR, anti-viral antibody screen, cytopathology, hemadsorption Not detected 


mycoplasma 


LookOut® ki 


t, mycoplasma detection kits/reagents 


Not detected 


pH 


meter or standard laboratory pH measurement 


6.5 - 8.5 (rec: 7.0-8.0) 


Standards for the testing of general serum properties and components have also been 
established to enable consistent evaluation for osmolality, total protein, hemoglobin, and 
general performance in culture applications. 


Limulus polyphemus, the Atlantic horseshoe 
crab, is particularly sensitive to the presence of 
endotoxins produced by the cell wall of Gram- 
negative bacteria that may also contaminate 
serum. Suppliers that follow industry guidelines 
will test for endotoxin in serum using the LAL 
(Limulus Amebocyte Lysate) assay. 


History: the importance of traceability and documentation in the serum industry 
While “origin” indicates the country in which serum was collected, “traceability” refers to how 


well the journey of a serum product—from colle 
and distribution—has been documented. 


ction at site of origin through manufacturing 


Serum delivered frozen to a processing facility is thawed and tested for the presence of 
bacteria, virus, endotoxin, and for hemoglobin content. The best quality serum is then filtered 
through a sequence of membrane filters, terminating with a 0.1 um pore filtration under 
aseptic conditions. After filtration, aseptic procedure is maintained as serum is dispensed into 


sterile bottles, quickly frozen 


to -20°C and quarantined until all lots have met or exceeded 


quality control standards. A certificate of analysis for each serum lot should always be 


available upon request from the supplier. 


Traceability for biologics is critical to ensure that label descriptions are an accurate 


representation of product history and contents. 
the customer and regulatory agencies can audit 


Complete documentation ensures that both 
the processing of serum products, and 


can uncover whether suppliers are engaging in serum pooling to offset local supplier cost 


fluctuations—a practice which can compromise 


performance. 


Economics 101: factors that may affect 
serum pricing 

Serum is a commodity product for which price 
varies among countries of origin. For example, 
the cost of serum originating from Australia 
and New Zealand may be higher than that 
from other countries. The OIE’s current 
Geographical Risk Assessment rating for 
these countries is high because they continue 
to be free of BSE and FMD (foot and mouth 
disease), in part due to geographical isolation. 
This, coupled with strict agricultural regulatory 
practices, may increase demand for serum 
products from the ANZ region, which in turn 
can drive up price. 


The availability of FBS and other animal sera 
is inherently subject to fluctuations in global 
animal disease states, climate, and food and 
water supply. These factors make industry- 
wide serum pricing sensitive in ways that 
pricing for non-biological reagents is not. And 
last—but certainly not least—as cell culture 
becomes more prevalent, particularly in the 
pharmaceutical sector, serum pricing is sure 
to reflect the changes in supply this demand 
may precipitate. 


For additional information 
about serum for cell and tissue 
culture, please visit 
sigmaaldrich.com/serumFAQs 


To request a sample, go to 
sigmaaldrich.com/serumrequest 
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LIFE SCIENCE TECHNOLOGIES 


new products: tissue/cell culture 


Apoptotic Cell-Free DNA 
Amplification Kit 

The TruePrime 
apoptotic cell-free DNA 
amplification kit from 
SYGNIS enables accurate 
DNA amplification 

using cell-free DNA 
(cfDNA) obtained from 
plasma, serum, urine, 
cerebrospinal fluid, and 
many other bodily fluids. 
It uses a novel multiple displacement amplification method to 
overcome the common limitations with analysis of cfDNA from 
bodily fluids. By exponentially amplifying cfDNA derived from 
apoptosis (the cell-death mechanism), TruePrime kits provide 
users with excellent sensitivity, error-free amplification in high 
yields, and a streamlined workflow for reduced hands-on time. 
SYGNIS 

For info: +49-(0)-6221-3540-120 

www.sygnis.com 


out 


Tissue Fixatives 

Poly Scientific R&D offers a line of specialty tissue fixatives. STU- 
IHC is a tissue preservative designed to give researchers gently 
fixed tissues that provide unimpeded access to epitopes for im- 
munohistochemical visualization. STUMol is designed for research- 
ers seeking maximum recovery of DNA, RNA, and proteins from 
source tissues; it is an excellent fixative to preserve tissue for cell 
harvest by laser-capture microdissection. STU-NGS is designed for 
those performing NGS studies of tumor tissue; yield and quantity 
of recovered DNA from STU-NGS fixed tissue exceeds that of for- 
malin. STU-ONE is an all-purpose nonformaldehyde tissue fixative; 
unlike formalin or paraformaldehyde, STU-ONE does not denature 
DNA, RNA, and proteins. Poly Scientific R&D will manufacture any 
stain or reagent to your specifications and ship directly to your 
laboratory, not to a distributor's shelf. 

Poly Scientific R&D 

For info: 800-645-5825 

www.polyrnd.com 


Flow Cytometer 

Designed and developed to remove common barriers associated 
with flow cytometry, the Attune NxT flow cytometer offers 
adaptable optical configuration options so you can get the most 
out of your multicolor analysis. The innovative combination 

of acoustic focusing technology, coupled with traditional 
hydrodynamic focusing, results in acquisition speeds up to 10X 
faster than those of other flow cytometers. Attune NxT is ideal 
for immunophenotyping and signaling studies, cell-cycle analysis, 
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detection of rare events, stem-cell analysis, cancer and apoptosis 
studies, microbiological assays, and more. Its modular design 
offers the flexibility to meet your future requirements. Purchase 
what you need now and easily upgrade later. The instrument’s 
small footprint requires minimal bench space—it can even be 
used in a cell culture hood. 

Thermo Fisher Scientific 

For info: 800-955-6288 

www.thermofisher.com 


Mycoplasma Test Kit 

The EZ-PCR Mycoplasma Test Kit is a highly sensitive, specific PCR- 
based assay designed to detect a broad range of mycoplasma 
species in cell cultures and other biological materials. This kit 
contains an optimized, complete PCR mix, including a variety of 
mycoplasma-specific primers, deoxyribonucleotide triphosphate 
(dNTP) mix, and Tag polymerase. With the kit’s simple protocol 
and ready-to-use format, samples can be prepared and run in 
about 10 minutes, with accurate results seen in only a few hours. 
Biological Industries 

For info: 860-316-2702 
www.bioind.com/ez-pcr-mycoplasma-test-kit 


Tissue Clearing System 

The X-CLARITY Tissue Clearing System is an all-in-one, easy-to- 
use solution for electrophoretic tissue clearing. Its unique design 
accelerates the removal of lipids from tissues while preserving 
the structural integrity of the sample. The X-CLARITY system can 
clear a whole mouse brain in just 6 hours, an astounding eight 
times faster than the original technology. Moreover, it success- 
fully produces transparent tissues that are efficiently penetrated 
by and labeled with macromolecules such as antibodies or oligo- 
nucleotides. This capability allows for 3D imaging of large tissues 
at single-cell resolution. 

Logos Biosystems 

For info: +82-(31)-478-4185 

www.logosbio.com 


Microplate for Microbial Research 

The novel shape of the FlowerPlate microplate ensures turbulent 
mixing for biological suspensions and broths when incubated and 
shaken on a BioLector or BioLector Pro system. The flower shape 
acts similarly to baffles in shake flasks, and increases mixing and 
gas/liquid mass transfer. In addition to the unusual well shape— 
which is proprietary to this plate—the underside features a clear 
polystyrene base through which real-time physical measurements 
can be made during the shaking and incubation cycle, giving valu- 
able kinetic information about these processes as they occur. The 
plates for online cultivations have a clear bottom and are available 
without optodes, with pH or dissolved oxygen (DO) optodes, or 
with both pH and DO optodes. The plates for offline cultivation 
are nontransparent. 

m2p-labs 

For info: 49-(0)-2401-80-53-30 
www.m2p-labs.com/bioreactors/microtiter-plates/flowerplate 
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Stericup® Quick Release 
Filtration Systems 
Work With Ease. Filter With Confidence. 


Stericup® Quick Release Filtration Systems streamline 
your workflow with ergonomic design updates and 
safeguard your results with the proven performance 
of Millipore® membranes. 
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Funnel Removal 

@ Frosted Writing Surface 

© Lighter Color for Legibility 

O Click-Seal Confidence Cap 
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2 PRECISION MEDICINE AND CANCER IMMUNOLOGY IN CHINA 


Articles continued 


Precision cancer medicine and immunology 
in China 
Xu-Chao Zhang and Yi-Long Wu 


From big data to knowledge in precision 
medicine 
Dechao Bu, Shaoliang Peng, Haitao Luo et al. 


The role of circulating cell-free DNA in 
the management of cancer in China 


Ying Hu, Yanhui Chen, Lei Zhang et al. 


Next-generation sequencing-based 
testing for cancer precision medicine 
in China: A review of technologies 
and validation procedures 

Weifeng Wang, Weiwei Shi, Ming Yao et al. 


ctDNA-NGS: The key to unlocking a 
molecular diagnostic revolution in the 
heart of Asia 

Ying Hou and Kang Ying 


Adoptive cell transfer therapy: A 
strategic rethinking of combination 
cancer therapy 

Minghui Zhang 


Foundational 
biomarkers 
for precision 
medicine 


Applications of this 
technology in the clinic 
are bringing researchers 
closer to real-time 
biomarker tracking. 


INTRODUCTIONS 


ne might argue that the concept of personalized medicine—in 
which a treatment is tailored for a specific individual based on 
their unique physiology as well as their specific disease and 
drug tolerance—is a foundational aspect of the ancient art of 
traditional Chinese medicine, practiced for centuries. Today, China’s 
medical practitioners depend less on ancient remedies and more on 
evidence-based practice. They bring with them an appreciation for 

the benefits and rationale of personalizing treatments to each patient. 
Therefore, to them, the shift from generalized therapies to precision 
medicine is perhaps an easier and more logical one to make than for 
those trained in westernized settings. 

In order for precision medicine—the term that now appears to have 
dislodged “personalized medicine”—to be successful, accurate characteri- 
zation of the patient is necessary. Various biomarkers provide the nec- 
essary data, collected through a variety of ‘omics techniques including 
next-generation DNA sequencing (genomics), analysis of protein levels in 
blood or tissues (proteomics), or determination of RNA levels (transcrip- 
tomics). However, identifying and characterizing biomarkers that accu- 
rately reflect a physiological state (normal or diseased), or response to a 
particular drug or therapy, has turned out to be challenging. Add to this 
the complication that biomarkers may differ between population groups, 
or indeed between individuals, and that tracking these biomarkers as 
the patient's status changes can be onerous, and the future of precision 
medicine could be described as bleak. 

This pessimistic outlook has not stopped researchers from pushing for- 
ward in their search for accurate and robust biomarkers. Big data analysis 
is helping, by providing a means to crunch millions of datapoints to yield 
associations that are not at first obvious. It is hoped that these associations 
will point to the presence of predictive biomarkers or potential targets for 
therapy, and also help to predict the risk of disease, ascertain the proba- 
bility of positive clinical outcomes, and evaluate therapeutic efficacy. Such 
biomarkers are also the ultimate goal of many next-generation sequencing 
studies being performed on a range of samples, including tumor tissue 
and circulating cell-free DNA. Applications of this technology in the clinic 
are bringing researchers closer to real-time biomarker tracking, with 
implications for cancer detection and the development of safe, effective 
treatments. 

New immunotherapy treatment modalities, such as the use of checkpoint 
inhibitors, cytokines, and chimeric antigen receptors, are being developed 
at an increasingly rapid pace, and the success of such therapies depends 
heavily on extensive knowledge of individual patients, for which high-qual- 
ity biomarkers are especially important. 

The articles presented in this booklet cover many of the topics above, 
with a focus on precision medicine research currently being performed in 
China. Researchers there are determined to overcome every obstacle to 
detecting and exploiting genomic and proteomic biomarkers in a clinical 
setting for the benefit of their patients. They also hope their insights 
will advance the practice of precision medicine both domestically and 
worldwide. 


Sean Sanders, Ph.D. 
Jackie Oberst, Ph.D. 
Science/AAAS Custom Publishing Office 


A global 
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rooted in 
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Building on its 

scientific roots and 
research foundation 

in China, BeiGene has 
established global 
clinical development 
capabilities with a 
significant presence in 
the United States, China, 
and Australia. 
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eiGene, Ltd. (NASDAQ: BGNE) is a global, commercial-stage 

biotechnology company focused on molecularly targeted 

and immuno-oncology cancer therapies. With a team of over 

800 employees in China, the United States, and Australia, 
BeiGene is advancing a pipeline of novel small molecules, monoclonal 
antibodies, and combination therapies for cancer treatment. BeiGene also 
markets ABRAXANE (nanoparticle albumin-bound paclitaxel), REVLIMID 
(lenalidomide), and VIDAZA (azaciditine) in China under a license from 
Celgene Corporation. 

BeiGene was founded in 2010 based on the premise that the confluence 
of two major developments-—the revolutionary scientific breakthroughs 
in cancer medicine, and the emergence of the pharmaceutical market in 
China, where nearly a quarter of the world’s cancer population has limited 
access to innovative therapies—may allow new biotech leaders to emerge. 
With Beijing-based R&D, BeiGene recruits from China's strong scientific 
talent pool and has developed a drug discovery platform incorporating 
tumor samples through local hospital collaborations. Its scientific advisory 
board consists of world-renowned scientists and clinicians and is chaired 
by Dr. Xiaodong Wang, cofounder of BeiGene, founding director and 
architect of China's National Institute of Biological Sciences, and a member 
of the Chinese Academy of Sciences and the U.S. National Academy of 
Sciences. 

Over the past seven years, BeiGene has discovered and advanced 
into clinical development four investigational drug compounds: Bruton’s 
tyrosine kinase (BTK) inhibitor zanubrutinib (BGB-3111), PD-1 antibody 
tislelizumab (BGB-A317), PARP inhibitor pamiparib (BGB-290), and RAF 
dimer inhibitor lifirafenib (BGB-283). Zanubrutinib is in registrational trials 
both globally and in China, and its global registration program includes a 
phase 3 head-to-head trial comparing BGB-3111 to ibrutinib, a currently 
approved BTK inhibitor, with the aim of demonstrating superior depth 
of response. Tislelizumab is the subject of a strategic collaboration with 
Celgene and is in registrational trials in China. BeiGene is also testing 
tislelizumab in combination with pamiparib and zanubrutinib, respectively. 
The company plans to initiate additional registrational trials of its assets, 
both in China and globally, and to advance additional preclinical assets 
into the clinic. 

Building on its scientific roots and research foundation in China, BeiGene 
has established global clinical development capabilities with a significant 
presence in the United States, China, and Australia. In addition, the com- 
pany has domestic manufacturing capabilities, including a multipurpose 
manufacturing facility in Suzhou and a commercial-scale biologics manu- 
facturing facility under construction in Guangzhou, established through a 
joint venture with the Guangzhou Development District. Through its strate- 
gic collaboration with Celgene, BeiGene also recently acquired Celgene’s 
commercial operations in China and gained exclusive rights to commer- 
cialize Celgene’s three approved therapies there, which is expected to help 
BeiGene prepare for the potential future commercialization of its internally 
developed compounds and any additional in-licensed compounds in 
China. BeiGene aspires to be a global biotech leader and is committed to 
bringing new, potentially life-altering treatments to patients worldwide. 


Xiaodong Wang, Ph.D. 

Founder & Chairman of Scientific Advisory Board, BeiGene 
John V. Oyler 

Founder & CEO, BeiGene 


Certain statements found herein may constitute forward-looking statements that involve 
numerous risks and uncertainties that are described in BeiGene’s filings with the Securities 
and Exchange Commission, and are made only as of the date of this publication. 


TARGETED THERAPY FOR LIVER CANCER: CHALLENGES AND OPPORTUNITIES 


Targeted therapy for liver 
cancer: Challenges and 
opportunities 


Shuzhen Chen?}, Jing Fu?#, 
and Hongyang Wang?2** 


iver cancer is the sixth most prevalent cancer 
and the second leading cause of cancer-related 
death worldwide (17). China alone accounts for over 
half of the new cases and deaths. It is estimated that 
in 2015 alone, 466,100 new cases of liver cancer 
were diagnosed in China and 422,100 deaths oc- 
curred there (2). Of all the cancers, the survival rate 
of liver cancer is the poorest—the age-standardized 
five-year relative survival rate is only 10.1% (3). Due 
to difficulties in early diagnosis, most liver cancer 
patients are diagnosed at an advanced stage, losing 
the opportunity for curative treatments such as liver 
resection or ablative procedures. 

Fortunately, the development of innovative tech- 
nology such as next-generation DNA sequencing 
has enabled a rapid and dramatic increase in our 
understanding of the genetic, molecular, and mor- 
phological changes occurring in individual cancer 
patients, laying the foundation for the emergence 
of targeted therapy. Although targeted therapies 
such as sorafenib treatment have raised hope for 
advanced liver cancer patients, their clinical benefits 
remain modest at best (4, 5). It is hoped that target- 
ed therapy will provide functional and even structur- 
al corrections at the molecular level, or at least offer 
a valid alternative to conventional treatment. Howev- 
er, liver cancer is an extraordinarily heterogeneous 
disease, which makes it difficult to properly stratify 
patients for optimal targeted treatment and increas- 
es the risk of side effects, leading to the persistent 
failure of targeted therapy (6). In this review, we 
summarize the progress made in targeted therapy 
for liver cancer treatment in China and focus on the 
challenges and opportunities thereof. 


Sorafenib 
Sorafenib is the first small-molecule targeted drug 
that has demonstrated a survival benefit in advanced 


‘nternational Co-operation Laboratory on Signal Transduction, Eastern Hepatobiliary 
Surgery Institute, Second Military Medical University, Shanghai, China 

2National Center for Liver Cancer, Shanghai, China 
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hepatocellular carcinoma (HCC) patients (5, 7). It is 

a multikinase inhibitor of several tyrosine protein 
kinases, including vascular endothelial growth factor 
receptor (VEGFR) and platelet-derived growth factor 
receptor (PDGFR). Sorafenib can also target intra- 
cellular serine and threonine kinase signaling such 
as RAF proto-oncogene kinase, including the C-Raf 
and B-Raf pathways (8-70). Sorafenib was approved 
as the only standard systemic treatment for HCC 
mainly on the basis of two studies: the Sorafenib 
HCC Assessment Randomized Protocol (SHARP) 
phase 3 trial (conducted in Europe, North America, 
South America, and Australasia), and a phase 3 ran- 
domized trial conducted in the Asia-Pacific region. 
According to these two studies, the sorafenib treat- 
ment group showed prolonged median survival: 
10.7 months in the SHARP study compared to 7.9 
months for the placebo group (5), and 6.5 months 
in the Asia-Pacific study compared to 4.2 months 
for the placebo (7). However, further analysis of 
these two trials and results from other studies have 
shown undesirable tolerability of sorafenib caused 
by its severe adverse events, including gastroin- 
testinal, dermatologic, hematologic, cardiovascular, 
and nervous system side effects (17-16), making 
patients reluctant to continue the treatment. 

Drug resistance is another bottleneck issue for 
sorafenib treatment. Numerous studies have re- 
vealed significantly differing responses to sorafenib 
due to tremendous variability in the way HCC 
progresses (17, 18). Chinese researchers have made 
serious efforts to decipher the mechanism underly- 
ing resistance to sorafenib and to identify potential 
biomarkers predictive of sorafenib response. A study 
by our team demonstrated that HCC patients with 
high 26S proteasome non-ATPase regulatory subunit 
10 (PSMD10) expression had worse prognosis and 
a poor response to sorafenib therapy. The overall 
survival time of soratenib-treated HCC patients 
with high levels of PSMD10 was much shorter than 
those with low PSMD10 (p=0.0099), with the medi- 
an survival time reduced by more than 40 months 
(p=0.0099). These results suggest that PSMD10 
may be a potential molecular marker in the classifi- 
cation of HCC patients who may not respond effec- 
tively to sorafenib (77). In another study, our clinical 
investigation revealed that HCC patients with low 
Src-homology 2 domain-containing phosphatase 2 
(Shp2) expression benefited from sorafenib admin- 
istration after surgery. This study showed that Shp2 
could promote liver cancer stem cell expansion 
by augmenting B-catenin signaling and might be 
a useful indicator when determining chemothera- 
peutic strategies (78). 
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The varied cellular metabolic phenotypes of tumor 
cells may also affect the efficacy of sorafenib. Inves- 
tigation of the metabolic characteristics of tumor 
samples from 63 HCC cases showed huge variation 
in lipid content and glucose uptake. This study found 
that the rate-limiting enzyme acetyl-coenzyme A car- 
boxylase alpha (ACCa) enhanced glucose-derived 
de novo fatty acid synthesis (FAS) and promoted tu- 
mor cell survival under energy stress, which contrib- 
uted to the heterogeneity of metabolic patterns in 
HCC. Inhibition of ACCa-driven FAS using a specific 
inhibitor, orlistat, improved the efficacy of sorafenib 
in xenograft-bearing mice, suggesting that interfer- 
ing with ACCa-driven FAS could sensitize HCC cells 
to sorafenib (79). Another study has reported that 
blocking interleukin-6/signal transducer and activa- 
tor of transcription 3 (STAT3)-mediated preferential 
glucose uptake could sensitize liver tumor-initiating 
cells to sorafenib treatment and enhance its thera- 
peutic efficacy in vivo (20). These findings suggest 
that a combination of sorafenib and inhibitors of 
certain metabolic pathways could be a promising 
approach for some HCC patients. 


EGFR inhibitors 

Epidermal growth factor receptor (EGFR) is over- 
expressed in 40%-70% of human HCCs, a factor that 
has been proven to be closely linked to the forma- 
tion and growth of tumors. But EGFR inhibitors have 
shown disappointing results in clinical trials with un- 
selected patients (27). A study was conducted in Tai- 
pei to evaluate the efficacy and safety of vandetanib, 
an oral inhibitor of both VEGFR and EGFR, in pa- 
tients with inoperable advanced HCC. The study ob- 
served no significant difference in the rate of tumor 
stabilization or vascular change between the vande- 
tanib group and the placebo group, suggesting that 
vandetanib had limited clinical activity in HCC (22). 
Other clinical trials with erlotinib, gefitinib, or cetux- 
imab showed only limited effects in advanced stage 
HCC or modest effects at most in phase 2 trials (27). 
A better understanding of the mechanisms underly- 
ing how EGFR signaling influences HCC progression 
is therefore needed. 

A study of the role of EGFR in HCC formation 
showed that the absence of EGFR in macrophages 
impaired the development of HCC in mice, where- 
as mice lacking EGFR in hepatocytes unexpectedly 
developed more HCCs due to increased compen- 
satory proliferation after cell damage. Following 
inflammatory stimulation, EGFR induces interleukin-6 
expression in liver macrophages, triggering hepato- 
cyte proliferation and the development of HCC (23). 
This study demonstrated that EGFR has different 
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roles in tumor cells than in nontumor cells, provid- 
ing some explanation of the disappointing results 

of anti-EGFR agents in HCC treatment. Other recent 
research by our team indicates that levels of choline 
kinase alpha (CHKA) are higher in human HCCs than 
in nontumor tissues, and that CHKA is associated with 
tumor aggressiveness and reduced overall survival. 
Further study has revealed that CHKA could facilitate 
a functional interaction between EGF and mammali- 
an target of rapamycin complex 2 (mTORC2), which 
could contribute to HCC metastasis by promoting 
AKT (Ser473) activation. In this way, overexpression 
of CHKA promotes resistance to EGFR-targeted 
drugs (gefitinib and erlotinib) in HCC, suggesting 
that dual inhibition of CHKA and mTORC2 could be 
a way to overcome the resistance of HCC cells to 
EGFR-targeted therapies (24). 


Immunotherapy 


GPC3-based immunotherapy 

Glypican-3 (GPC3) can be detected in 72% of 
HCCs, but could not be detected in normal hepato- 
cytes, cirrhotic liver, or benign liver lesions (25). In 
addition to being a marker for HCC, GPC3 plays a 
role in the progression of the disease. It activates 
Wnt signaling and stimulates cell cycle progression 
and cell survival (26), indicating that anti-GPC3 
therapy could be a therapeutic strategy for HCC 
treatment. The potential usage of GPC3-derived 
antibody or peptide vaccines has been explored in 
HCC immunotherapy (27-29). Disappointingly, in 
clinical trials these agents showed only limited cura- 
tive effect (30). 

Chimeric antigen receptor T (CAR-T) cells have 
been heralded as a breakthrough technology due 
to the substantial benefits observed in patients 
with relapsed or refractory B-cell malignancies. 
More than 200 CAR-T cell clinical trials have been 
initiated so far, most of which are CD-19 specific 
CARs aimed at treating lymphoma or leukemia (37). 
Researchers interested in HCC have mainly explored 
the possibility of redirecting T cells to recognize 
GPC3 for the treatment of HCC. T cells with CARs 
or high-affinity T-cell receptors (TCRs) targeting 
GPC3 were therefore engineered. Such targeted 
cells can efficiently recognize and destroy GPC3- 
positive human HCC cells in vitro and in vivo (32, 
33). In a recent study, Li and colleagues developed 
T cells carrying two complementary CARs—against 
both GPC3 and asialoglycoprotein receptor 1 
(ASGR1)-to reduce the risk of on-target, off-tumor 
toxicity, while maintaining relatively potent antitumor 
activity (34). These preclinical studies suggested that 


TARGETED THERAPY FOR LIVER CANCER: CHALLENGES AND OPPORTUNITIES 


Ramucirumab 

ah Sorafenib 

EGFR PDGFR GPC3 Vandetanib 
@ 6 


Extracellular 


Selumetinib | 


Cae | | 


— activation 
—— inhibition 


Proliferation, Differentiation, Survival, Angiogenesis, Stemness, Metastasis 


FIGURE 1. Schematic representation of the signaling network and agents involved in targeted therapy 
for liver cancer. Erlotinib and gefitinib are tyrosine kinase inhibitors targeting EGFR. The PI3K/AKT/ 
mTOR/p70S6 and Ras/Raf/MEK/ERK pathways are involved when EGFR signaling is activated, and can 
be blocked by independent inhibitors. Vandetanib and sorafenib are multikinase inhibitors of several 
tyrosine protein kinases such as VEGFR and PDGFR. GPC3 increases the binding of Wnt to FZD, which 
results in the stimulation of B-catenin transcriptional activity. Approaches to targeting GPC3 in HCC such 
as anti-GPC3 antibody or GPC3-targeted chimeric antigen receptor have shown some promise in differ- 
ent studies. EGFR, epidermal growth factor receptor; PDGFR, platelet-derived growth factor receptor; 
GPC3, glypican-3; Ab, antibody; VEGFR, vascular endothelial growth factor receptor; PI3K, phosphati- 
dylinositol-3 kinase; mTOR, mammalian target of rapamycin; p70S6, ribosomal protein S6 kinase; MEK, 
MAPK/ERK kinase; MAPK, mitogen-activated protein kinase; ERK, extracellular signal-regulated kinase; 
DSH, dishevelled; FZD, frizzled; HIF-10, hypoxia-inducible factor 1-alpha. 


an open-label, 
noncomparative, 
phase 1/2 dose 
escalation and 
expansion trial. 
The study showed 
that nivolumab 
produced durable 
objective respons- 
es in long-term 
survival rates in 
patients with ad- 
vanced HCC (37). 
However, it 
is important to 
recognize that in 
previous studies 
the response rate 
to anti-PD-1 asa 
stand-alone ther- 
apy was 10%-30% 
overall, which 
included immu- 
nogenic tumors 
such as malignant 
melanoma that 
have a much high- 
er response rate 
(36). One possible 
explanation for 
the low response 
rate might be 
the influence of 
molecules in- 
volved in immune 


escape other than 


adoptive transfer of GPC3-specific T cells presents 

a promising therapeutic strategy for treating HCC. 
Anti-GPC3 CAR-T therapy is now undergoing clinical 
trials in China. 


Anti-PD-1/L1 

Programmed cell death 1 (PD-1) is an immune 
coinhibitory receptor expressed on immune cells 
such as T cells, B cells, and natural killer (NK) cells. 
PD-1 suppresses antigen-specific T-cell activation 
through interaction with its ligand, PD-L1, which has 
been observed to be upregulated in tumor cells 
(35). Clinical trials of antibodies targeting PD-1 or 
PD-L1 for the treatment of HCC have shown some 
promising results (36). A recent report in The Lancet 
evaluated the safety and efficacy of PD-1 inhibi- 
tor nivolumab in patients with advanced HCC in 


PD-1 and PD-L1/2. 


Therapy that combines antibodies to block three 
inhibitory immune checkpoint molecules—PD-1, T cell 
immunoglobulin and mucin domain 3 (TIM-3), and 
lymphocyte-activation gene 3 (LAG-3)-has already 
been reported to restore the immune response to 
tumor antigens of HCC-derived T cells (38). However, 
there is still a long way to go before such combined 
antibody therapy can be established and tested in 
clinical trials. 


Potential targeted therapies 

Recent findings have also shed light on 
other potential targets for HCC treatment. The 
development of HCC is a multistep process 
with high intratumoral heterogeneity, including 
alterations in tumor microenvironment, signaling 
pathways, and energy metabolism patterns. In 
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previous reports, it has been noted that adenosine 
monophosphate (AMP)-activated protein kinase 
(AMPK) serves as an energy sensor in eukaryotic 
cells and plays a role in linking metabolism and 
cancer development (39-41). However, our recent 
research has demonstrated that activation of AMPK 
by the first-line medication metformin for the 
treatment of type 2 diabetes, not only inhibited HCC 
cell growth in vivo, but also augmented the growth 
inhibition induced by the chemotherapy drug 
cisplatin in these cells (42). Another study observed 
an imbalance of gut microflora as well as intestinal 
inflammation in chronic treatment of rats with the 
carcinogen diethylnitrosamine. Modulation of gut 
microbiota by probiotics dramatically mitigated 
liver tumor growth and spread in vivo (43). These 
studies indicate that an intervention strategy based 
on studies of HCC heterogeneity may present a 
new avenue for therapeutic intervention to treat the 
disease. 


Perspectives 

An improved understanding of the molecular 
pathways that drive development of HCC has led 
to the identification of various biomarkers and the 
evaluation of several agents specifically targeted 
to tumor cells with particular molecular features. 
However, clinical trials undertaken worldwide have 
documented only occasional positive responses to 
such treatments. To date, no single agent or single 
targeted therapy has been formally found to be a 
cure for HCC in clinical trials. An increasing number 
of studies have demonstrated that intratumoral 
heterogeneity in individual patients is a roadblock 
for HCC targeted therapy. Therefore, the efficacy of 
targeted therapy requires a thorough understanding 
of the tumor microenvironment, metabolism, 
and gut microbiota of an individual. Meanwhile, 
combined therapies may be more effective than 
the administration of a single agent. IL-6 and PD-L1 
blockade, or sorafenib combined with anti-PD-L1 
monoclonal antibody, have demonstrated better 
efficacy than a single inhibitor in mouse models (44, 
45). It is hoped that the establishment of combined 
therapies can offer a way to successfully manage 
HCC patients in the future. 
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The challenges of radiation 
oncology in the era of 
precision medicine 


Ligang Xing and Jinming Yu* 


adiotherapy is an essential treatment 
in the management of cancer. Seventy percent 
of all cancer patients will get radiotherapy as 
at least part of their treatment (7). Advances in 
physics, mathematics, computer science, electrical 
engineering, and radiobiology have significantly 
improved the safety, precision, and efficacy of 
radiotherapy, the concomitant control of tumor 
growth, and the probability of a cure for many 
cancer sufferers. In recent years, the sequencing of 
the human genome has paved the way for precision 
medicine, which aims to deliver “the right treatment 
to the right patient at the right time.” Although 
discoveries arising from studying the genome have 
affected the delivery of chemotherapy and targeted 
biological agents (2), they have yet to impact the 
clinical use of radiotherapy. In this new era of 
precision medicine, radiotherapy poses both great 
challenges and great opportunities for physicians 
and researchers. 


Progress in modern radiotherapy techniques 
New technologies have been the main driving 
force behind innovations in radiation oncology 
over the last two decades. Technological 
advances have been put to clinical use, leading 
to better localization of the radiation dose and 
less damage to healthy tissue. These methods 
include technologies such as three-dimensional 
conformal radiotherapy (3D-CRT), intensity- 
modulated radiotherapy (IMRT), 3D brachytherapy, 
stereotactic radiotherapy, image-guided and 
adaptive radiotherapy (IGRT and ART), and 
charged particle radiotherapy using protons and 
carbon ions. Using 3D-CRT and IMRT, we can 
make the radiation conform to the shape of the 
target volume, solving the problem of irradiating 
complex targets that lie close to critical healthy 
structures. 3D-CRT and IMRT are routinely and 
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successfully applied for head and neck cancers, 
prostate cancer, and many other common cancers 
including those of the lung, liver, esophagus, and 
breast (3, 4). In the past 10 years, there has been 
rapid growth in clinical research on and application 
of stereotactic ablative body radiotherapy (SABR), 
also known as stereotactic body radiation therapy 
(SBRT), for cancers such as lung, liver, spine, 
pancreas, and prostate. For example, by accurately 
delivering high-dose radiation to the tumor, SBRT 
has emerged as the standard of care for medically 
inoperable stage | non-small cell lung cancer 
(NSCLC) and may even outperform surgery in 
operable patients (5, 6). 


Genomics for personalized radiotherapy 

From 3D-CRT and IMRT to SBRT, two factors 
limit the efficacy of radiotherapy: (1) defining the 
target, or differentiating between tumor and normal 
tissue; and (2) determining the appropriate total 
dose of radiation and its “fractions”—the number 
of separate treatments into which the total dose is 
divided. In clinical practice, radiotherapy dosages 
and fractions have been determined empirically, 
which has resulted in reasonable disease control 
and acceptable levels of toxicity. However, results 
suggest that current radiotherapy dosing protocols 
can be further optimized with a modern precision 
oncology approach, such as gene profiling to detect 
relevant biomarkers or biomarker signatures that 
would inform clinicians of a particular patient's 
sensitivity or resistance to radiotherapy. Existing 
tests that predict a patient's sensitivity to radiation 
can be grouped into three categories: those 
that determine intrinsic radiosensitivity, those 
that determine tumor oxygen levels, and those 
that determine a tumor's chance of growing 
(7). Unfortunately, none of these approaches is 
practical for clinical application. Radiotherapy is 
used in different settings depending on the site 
of the disease, so the clinical utility of a molecular 
biomarker signature indicating sensitivity to 
radiation would vary depending on the clinical 
application. The development of clinically relevant 
radiosensitivity molecular signatures is therefore 
challenging (8). 

Recently, Scott and colleagues identified 10 
genes that could index radiosensitivity (9). This 
could allow the radiation dose to be individually 
tuned to a tumor’s radiosensitivity and provide 
a framework for designing genomically guided 
clinical trials in radiation oncology. Being able to 
increase radiation dosage for more resistant tumors 
and lower it for more sensitive tumors would also 


lower the risk of complications from the therapy. 

It should be emphasized that tumor genomic data 
gives information only about a tumor’s intrinsic 
radiosensitivity. Additional biological insights about 
the tumor and its microenvironment, as well as 
information about the patient, are also important 
for optimizing radiotherapy dosing. 


Combining radiotherapy with targeted therapy 
Biomarkers that can predict tumor sensitivity to 
therapy are considered the gatekeepers necessary 
to develop precise and personalized medicine (10). 

These biomarkers are therapy-specific, and can 
therefore aid in therapeutic decision-making. For 
example, mutations in the epidermal growth factor 
receptor (EGFR) gene have been shown to predict 
the benefit derived from using tyrosine kinase 
inhibitors (TKls), while anaplastic lymphoma kinase 
(ALK) gene rearrangements have been shown to 
predict the efficacy of ALK inhibitors in treating 
NSCLC (11, 12). Building on the preclinical rationale 
that inhibitors of EGFR function create a strong 
sensitivity to radiation (73), serial clinical trials have 
been conducted to test combinatorial treatments 
using EGFR inhibitors plus radiotherapy. In one 
pivotal phase 3 trial, adding the chemotherapy 
drug cetuximab to radiation improved localization 
of therapy in locally advanced head and neck 
squamous cell cancer and improved overall 
survival (14). However, phase 3 trials that evaluated 
cetuximab in combination with chemoradiotherapy 
for NSCLC and esophageal cancer all failed to 
improve overall survival in an unselected patient 
population (75, 76). 

Phase 1 and 2 trials of EGFR TKls in combination 
with radiotherapy for locally advanced NSCLC or 
metastatic NSCLC have shown a favorable safety 
profile and some encouraging outcomes (77, 18). 
However, these trials were all performed in pa- 
tients without information on whether their tumors 
carry the EGFR mutation, making the results less 
informative. These studies highlight the need for 
predictive biomarkers in cases where targeted 
therapy is combined with radiotherapy. 


Radiotherapy combined with modern 
immune-targeted therapy 

Understanding of the interaction between the 
immune system and tumor growth has led to the 
development of modern cancer immunotherapies. 
These include cancer vaccines, chimeric antigen 
receptor T-cell (CAR-T) therapy (in which immune 
system T cells are reengineered to act against 
a cancer), and immune checkpoint inhibitors, 
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which interfere with proteins that prevent T cells 
from responding to cancer. Immune checkpoint 
inhibitors targeting several types of proteins, 
including cytotoxic T-lymphocyte antigen-4, 
programmed death-1 (PD-1), or programmed 
death ligand-1 (PD-L1), have demonstrated 
clinical efficacy against a broad spectrum of 
tumor types—a significant step for both science 
and medicine (19). Early studies revealed that 
radiotherapy could provoke an immune response 
not only at the irradiated site, but also at remote, 
nonirradiated tumor locations—the so-called 
“abscopal effect.” Cell death in the irradiated 
tumor can enhance antitumor immunity by 
inducing the expression of certain antigens on 
tumor cells and by activating lymphocytes to 
attack the tumor. Preclinical and clinical studies 
have demonstrated the efficacy and safety of 
radiotherapy combined with immunotherapy (20, 
21). Currently, clinical trials of such combined 
treatments for a variety of tumor types are 
underway. 

Most modern immunotherapies are not yet 
cost-effective, especially for patients in China, 
and immune checkpoint inhibitor therapy is not 
sufficiently precise yet. A crucial step in refining 
these therapies is the identification of biomarkers 
that can predict a tumor’s response to checkpoint 
blockades (22). The overexpression of the PD-L1 
antigen, the presence of tumor-infiltrating immune 
cells, or a variety of molecules in the tumor’s 
microenvironment may be important predictive 
biomarkers and are being extensively explored. 
However, they are not yet sufficiently predictive to 
allow them to be used to routinely stratify patients. 
Gene analysis is a new approach for judging the 
potential clinical benefit of checkpoint inhibitors 
(23), but further preclinical and clinical studies 
are necessary before it can be applied in clinical 
practice. In order to move the strategy successfully 
into the clinic, it is also critical to clarify the 
appropriate fractions and doses of radiotherapy 
and the suitable combinations of radiotherapy and 
immunotherapy (24). 


Molecular image-guided precision radiotherapy 
Imaging plays a critical role in precision medi- 
cine, from screening and early diagnosis to guiding 
treatment, evaluating responses to therapy, and 
assessing the likelihood of disease recurrence (25). 
Rapid advances in imaging technologies permit 
better anatomical resolution and provide noninva- 
sive measurements of functional and physiological 
properties of tissues and lesions at the molecular 
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level. The development and application of molec- 
ular imaging techniques brings new opportunities 
for creating more precise treatment. 

Novel molecular imaging approaches are 
being developed and validated in many critical 
molecular pathways, such as glucose and amino 
acid metabolism, cell proliferation, hypoxia, 
angiogenesis, and receptor expression. The 
concept of “biological target volume” has been 
introduced as a factor when determining the 


intensity of radiotherapy needed for treatment (26). 


We are looking for the best way to use molecular 
imaging to guide radiotherapy for certain cancers, 
either to help define the target volume to be 
irradiated, or to aid in patient stratification. It was 
recently reported that an escalated radiation dose 
to treat a particular type of lung tumor detected 
by a mid-treatment positron emission tomography 
(PET) scan allowed clinicians to deliver higher- 
dose radiation to the more aggressive areas of the 
tumor and improve local control of tumor growth 
without increasing radiotherapy-induced lung 
toxicity (27). PET and computed tomography scans 
could also identify and delineate hypoxic areas 
that could be targeted for elevated dosing in lung 
cancer patients (28). 

The role of cancer imaging in precision 
medicine is being explored from another angle 
as “radiomics,” which assesses a large number of 
imaging features that characterize the observable 
properties of a tumor, using descriptors beyond 
simply its size to predict clinical outcomes with 
increased prognostic power, or even correlate with 
gene expression profiles (29). This approach could 
be important in helping to stratify patients who are 
at risk of disease recurrence (30, 37). 

In summary, precise radiation therapy is being 
explored at four different levels: (1) clinical 
features such as the molecular structure of the 
tissue, cancer stage, and tumor volume and 
location(s); (2) adaptive radiotherapy based 
on images collected during treatment; (3) 
biomarker-guided therapy; and (4) personalized 
radiotherapy delivery schedule (32). We believe 
that with multidisciplinary guidance, the strong 
support of science and technology, and an eye 
to cost-effectiveness, precision radiotherapy that 
incorporates radiobiology, bioinformatics, and 
molecular imaging will eventually be realized. 
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The role of multidisciplinary 
efforts in precision medicine 
and immunology for clinical 

oncology 


Jianzhen Lin’, Angiang Wang?, 

Junyu Long’, Haohai Zhang’, Yi Bai’, 
Xiaobo Yang?!, Jie Pan?, Ke Hu4, Lin Zhao®, 
Xinting Sang’, and Haitao Zhao!™ 


ancer is a group of highly heterogeneous dis- 

eases in which dysregulation of key cellular processes 
leads to neoplastic transformation. Tumor immunity 
and suppression of the immune response within the 
tumor microenvironment support its progression. In 
the past few years, dramatic advances in DNA se- 
quencing technologies have facilitated key insights 
into the genomic alterations and somatic mutations 
that enable cancer formation and progression, allow- 
ing for significant advances in precision medicine. 
The broad lexicon of precision medicine describes 
the narrowing of medical care to the characteristics 
of an individual patient. It improves upon the current 
approach of stratifying patients into treatment groups 
based only on phenotypic biomarkers, and instead 
makes use of a patient's molecular information (in- 
cluding genomics and proteomics) to inform diagno- 
sis, prognosis, treatment, and disease prevention for 
that individual. 

Oncology is at the frontline of precision medicine. 
It is evolving from the previous model of administer- 
ing cancer therapeutics through a unified treatment 
regimen based on cursory tumor classification, to 
one that applies the precise molecular profile of an 
individual's cancer genome to optimize personalized 
treatment. Another new and highly successful thera- 
peutic approach to cancer, immunotherapy (Science's 
2013 Breakthrough of the Year), has created great 
excitement for clinicians, patients, researchers, and 
scientific journals. Current strategies in cancer immu- 
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notherapy include chimeric antigen receptor (CAR) 
T-cell therapy, immune checkpoint inhibitors, and 
cancer vaccines. These treatments have demonstra- 
ble clinical benefits and show great promise. How- 
ever, there are still some challenges that precision 
medicine and immuno-oncology must confront. 
For precision medicine the most pressing are: 


1. Tumor heterogeneity and molecular evolution 

Tumors exhibit tremendous genetic heterogene- 
ity, both among different kinds of tumors and within 
a single tumor. This creates phenotypic variation, 
posing a significant challenge to personalized cancer 
medicine. Intratumor heterogeneity increases the 
complexity of cancer prognosis and likely contrib- 
utes to tumor metastasis under therapeutic pressure. 
Exploring genomic alterations by time series analysis 
can identify the factors that drive a tumor’s evolution 
during treatment, which may identify the molecular 
targets of resistance or tumor progression. Rapid 
advances in technology for studying tumor heteroge- 
neity would help us understand the tumor genomic 
landscape. Among such technologies are liquid bi- 
opsy assays of circulating tumor DNA, detection and 
analysis of circulating cancer stem cells, and multire- 
gion next-generation sequencing. 


2. Drug resistance 

Understanding the clonal make-up of tumors, their 
molecular evolution, and their response and resis- 
tance to drugs poses perhaps the greatest challenge, 
not only to the application of traditional therapies, 
but also to personalized cancer medicine. To improve 
an individual's treatment response and clinical out- 
comes, clinicians are obliged to inspect the evolving 
nature of the cancer genome. Moreover, combina- 
tional cancer treatments that target complementary 
signaling pathways, or harness the immune system 
through immunotherapy, have promise to overcome 
resistance while improving efficacy. 


3. Shortage of agents that target specific genomic 
aberrations 

Precise targeted therapy, which is based on the 
genomic characteristics of an individual patient and 
the mutations identified in their particular cancer, 
is limited by a shortage of biochemical agents that 
target specific genomic aberrations. In clinical prac- 
tice, it is common to detect significant gene muta- 
tions in a cancer patient but to have no existing drug 
that can target those mutations. Or, as also happens, 
the targeted agent used has little antitumor efficacy. 
This may be a result of tumor heterogeneity (in which 
clones of resistant tumor cells survive and proliferate) 
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and the variability in the drug's effect on the targeted 
genes, or the presence of nonidentical molecular 
pathways in different patients due to inherent physio- 
logical and biochemical differences. 

For cancer immunotherapy treatments, the most 
pressing challenges are: 


1. Lack of a definitive response to treatment 

In many immunotherapy trials, durable responses 
and extended, long-term survival are seen only in a 
subset of patients, and it has been reported in some 
clinical cases that immunotherapy treatment could 
even promote cancer progression. These challenges 
highlight the importance of identifying distinguishing 
clinical and molecular characteristics that can ex- 
plain the differential response. Clues to what factors 
might predict a patient's response to therapy must be 
sought, and clinicians must be cognizant of overpre- 
scription in refractory patients. 


2. Identification of suitable patients 

The limitations of immunotherapy approaches have 
aroused researchers’ attention. CAR-T therapy, for 
example, has had clinical success against lymphocytic 
malignancies, but not against more common solid 
tumors. Even though antibodies that modify immune 
function can help patients defend against tumors, 
overall response, rates are still relatively low, ranging 
from 10% to 30%. Thus, precisely identifying which 
patients are likely to benefit from immunotherapy 
is an important issue to address. Identifying unique 
clinical, genomic, and molecular patient character- 
istics will help researchers identify those factors that 
might predict response, and presents an opportunity 
to heighten response in refractory patients. Biomark- 
ers are crucial to identifying suitable patients, but the 
complexity of the immune system makes their devel- 
opment a challenge. 


3. Improving efficacy using combinational therapies 
There is considerable current interest in combining 

treatment modalities to improve the effectiveness 

of immunotherapies in a broad array of patients. 

This includes immunotherapies used in conjunction 

with chemotherapy, radiotherapy, targeted thera- 

py, and even other immunotherapies. But caution 

is recommended. Our understanding of biological 

mechanisms leads us to expect additive or synergistic 

responses from such combined treatments. However, 

the complexity of the immune response could lead 

to unexpected, even serious, adverse events. Crucial- 

ly, immunomodulatory therapies carry distinct risks, 

including autoimmune reactions that can be fatal if 

doctors are not alert and ready with proper treatment. 


To overcome these tough challenges, precision 
medicine and immunology for clinical oncology 
require multidisciplinary efforts. Large-scale 
genomic data needs to be integrated with clinical 
data, analyzed, and translated into information that 
can guide clinical decisions. Multidisciplinary efforts 
can also ensure the rational application of cancer 
immunotherapy and combinational therapies by 
using multi-omics data to improve the diagnosis, 
prognosis, and treatment of cancer. 

The Department of Liver Surgery and the Center 
for Translational Medicine at Peking Union Medical 
College Hospital have been focused for several years 
on researching precision tumor medicine and im- 
munotherapy and their translation to clinical use. As 
personalized cancer medicine moves to the clinic, 
conceiving of cancer as a systemic, highly heteroge- 
neous and complex disease becomes even more apt. 
Quality cancer care demands that we form a multidis- 
ciplinary team (MDT) of highly qualified health care 
professionals, with medical oncologists at its core. 

We see the characteristics of our MDT as including: 


1. A multidisciplinary approach 

Our MDT members come from specialties includ- 
ing hepatobiliary surgery, radiology, radiotherapy, 
medical oncology, and pathology. Also on the team 
are experts in cancer biology and bioinformatics who 
can help interpret information on genomic aberra- 
tions. We strive to translate dispersed knowledge into 
an integrated, coherent, and personalized treatment 
regimen. 


2. A patient-centric model 

Patients’ perceptions of their care are created by 
the quality of the care, the outcome of the treatment, 
the empathy displayed by the physician and health 
care team during their interactions, and each patient's 
individual world view. Therefore, the consulting mod- 
el of our MDT practice is patient-centric. This means 
that all decisions consider the relationship between 
the patient, his or her family, and our health care 
team. We provide face-to-face counseling for every 
patient, and guarantee no less than a half-hour con- 
sultation for deciding on a personalized therapeutic 
regimen and answering patient questions. 


3. Guideline-first decision making 

A core principle of our MDT decision-making is 
“guideline-tirst,” which means that every decision 
is evidence-based, and that each patient is treated 
according to clinically accepted and approved guide- 
lines. The goal is to provide appropriate therapy for 
cancer patients to prolong their overall survival. 


4. A personalized therapeutic regimen 

Personalizing cancer care implies using MDTs for 
clinical decision-making, since personalized care 
requires input from a range of different scientific and 
care domains. In our teams, clinical information and 
genomic profiling results are reviewed by profession- 
als to identify clinically significant results. Patients for 
whom traditional therapies such as surgery, chemo- 
therapy, or radiotherapy have failed, or who have 
elected to give up such treatments to try targeted 
therapy, are matched with a targeted therapy or 
immunotherapy regimen if one is available. These 
treatments might be administered through a clinical 
trial or using a drug approved by the China Food 
and Drug Administration or the U.S. Food and Drug 
Administration. The team also creates personalized 
therapeutic regimens using cancer immunotherapy 
or combinational therapy based on individual clinical 
characteristics, genomic aberrations, and the specific 
microenvironment of each tumor. 


5. Public welfare 

The operation of our MDTs is supported by the 
Chinese Precision Medicine and Immunotherapy for 
Clinical Oncology Fund, which is a public welfare 
organization affiliated with the China Social Welfare 
Foundation. We primarily provide free services to 
hepatobiliary cancer patients, including multidisci- 
plinary consultation, tumor genomic sequencing, and 
protein expression analysis to identify immune system 
biomarkers. Our mission is firstly to raise awareness 
of both current achievements using targeted thera- 
pies and immunotherapy, and of the potential and 
limitations of these therapies; and secondly to guide 
patients in seeking out clinical trials where their tu- 
mors can be better profiled and they can gain access 
to novel treatments. We also plan to develop more 
clinical trials to implement personalized therapeutic 
regimens. 

Despite some obvious challenges, the largely en- 
couraging clinical results from targeted therapeutics 
and biomarker-guided clinical trials are fueling fur- 
ther technological advancements in next-generation 
sequencing technology, data interpretation, and the 
associated preclinical and clinical cycles of drug de- 
velopment. All of this translates into a major shift in 
clinical practice. Moreover, cancer immunotherapy, 
especially using single-antibody immune checkpoint 
inhibitors, has shown promising efficacy against 
many solid cancers, and significantly improves the 
response rate against several solid cancers when 
used in combination with targeted drugs. These ad- 
vances will improve the survival and quality of life of 
many cancer patients in the near future. 
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Current status of 
immunotherapy in 
advanced HCC 


Shukui Qin* 


rimary liver cancer, consisting mainly of hepa- 
tocellular carcinoma (HCC), is one of the most com- 
mon cancers worldwide, with especially high prev- 
alence in China and a growing incidence: 782,000 
new cases were reported worldwide in 2012 (1, 
2). As one of the leading causes of cancer-related 
deaths, HCC was responsible for an estimated 
746,000 deaths worldwide in 2012 (2). China alone 
accounted for more than 50% of both new HCC 
cases and HCC-related deaths globally (3). Chronic 
infection due to hepatitis B virus (HBV) or hepatitis 
C virus (HCV) contributes to an estimated 75% of all 
HCC cases (4). In the Asia-Pacific region, more than 
75% of cases are associated with HBV infection (5). 

Surgical resection or orthotopic liver transplant 
(OLT) offer the best chance for successful treatment 
ot HCC (6). However, surgical resection, liver trans- 
plantation, and radiofrequency ablation (RFA) are 
only applicable to a small portion of patients with 
well-preserved liver function who have early-stage or 
localized HCC (7). Due to difficulties in early diag- 
nosis, most HCC cases are locally advanced or show 
distant metastases at the time of diagnosis. These 
patients generally have a poor prognosis, with me- 
dian survival of 6 to 20 months (8), and a five-year 
survival rate of less than 16% (9). 

For local advanced or metastatic HCC, systemic 
therapy is often used as an important palliative treat- 
ment. Various conventional systemic chemotherapy 
regimens such as doxorubicin have been used clini- 
cally, although there are few well-controlled studies 
demonstrating the efficacy of systemic chemother- 
apy in the management of HCC (70). In 2012, our 
group was the first to demonstrate that a regimen of 
"FOLFOX4" (oxaliplatin, 5-fluorouracil, and leucov- 
orin) significantly improved the objective response 
rate (ORR) and prolonged overall survival (OS) 
compared with doxorubicin alone in a randomized 
phase 3 clinical trial in Chinese HCC patients (EACH 
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study) (77). Based on this trial, the FOLFOX4 regi- 
men was approved by the Chinese Food and Drug 
Administration (CFDA) for use in first-line treatment 
of advanced or metastatic HCC patients in China. 

Besides chemotherapy, molecular target 
therapies have also been evaluated in clinical 
practice globally. In 2007, sorafenib, a tyrosine 
kinase inhibitor against multiple targets including 
RAF, vascular endothelial growth factor receptor, 
and platelet-derived growth factor receptor, was 
shown to extend median overall survival in patients 
with advanced HCC, and was approved by the 
U.S. Food and Drug Administration (FDA) and the 
European Medicines Agency (EMA) as a first-line 
treatment for advanced HCC. However, its use in 
the clinic has been limited due to low ORR, limited 
survival benefit, toxicity, and cost. After sorafenib, 
a series of molecular-targeted drugs including 
sunitinib, brivanib, lapatinib, linifanib, everolimus, 
axitinib, pazopanib, tivantinib, and ramucirumab 
were studied. Unfortunately, all failed in phase 3 
trials. Additionally, trials combining either sorafenib 
with erlotinib or sorafenib with transarterial 
chemoembolization were also unsuccessful. In 
the decade since the approval of sorafenib, only 
two phase 3 studies, both involving multikinase 
inhibitors, generated positive results: regorafenib 
versus placebo as a second-line treatment 
(RESORCE study, ClinicalTrials.gov: NCT01774344) 
(12) and lenvatinib versus sorafenib as a first-line 
treatment (REFLECT study, ClinicalTrials.gov: 
NCT01761266) (13). Most recently, the U.S. FDA 
approved regorafenib for advanced HCC patients 
who have been previously treated with sorafenib. 
Lenvatinib is currently under regulatory review by 
FDA and EMA for first-line treatment of advanced 
HCC patients. 

Although there are a growing number of 
molecular target therapies and a few cytotoxic 
drugs under rapid development, clinical outcomes 
have been unsatisfactory to date. There is a great 
unmet medical need for novel therapies with better 
response rates and survival. 


Immunotherapies for HCC 

Tumor immunotherapies, especially immune 
checkpoint inhibitors, have recently been estab- 
lished as effective treatments for several types of 
cancers. Immune checkpoint inhibitors release the 
“brakes” on the immune system and restore the 
ability of immune cells to eliminate cancer cells. 
Several immune checkpoint targets such as pro- 
grammed cell death protein 1 (PD-1), programmed 
death ligand 1 (PD-L1), and cytotoxic T-lymphocyte 


antigen 4 (CTLA-4) are being investigated exten- 
sively in various types of cancers. In the past few 
years, many clinical studies have shown immuno- 
therapy to be a promising treatment option for 
many solid tumors and hematologic malignancies. 
Some of these agents are now approved for the 
treatment of melanoma, non-small cell lung cancer, 
renal cell carcinoma, head and neck cancer, bladder 
cancer, liver cancer, gastric cancer, mismatch repair 
(MMR)-deficient cancer, Merkel cell carcinoma, and 
Hodgkin's lymphoma. 

The human liver has a unique immunobiology 
whereby multiple regulatory mechanisms are 
in place to maintain an immunosuppressive 
environment. Normal liver tissue is inherently 
tolerogenic to environmental autoantigens and 
toxins in order to prevent aberrant immunity 
to pathogens encountered through arterial 
circulation and from the gut (14, 15). Clinical and 
nonclinical data showed increased numbers 
of immunosuppressive cells in HCC, including 
regulatory T cells and myeloid-derived suppressor 
cells, as well as increased expression of inhibitory 
signaling molecules such as CTLA-4 and PD-1 
(16-18). HBV and HCV infections have also been 
associated with the increased proliferation of 
regulatory T cells and upregulation of PD-L1/ 
PD-1 expression, thereby suggesting a role 
for this pathway in HBV or HCV-mediated 
hepatocarcinogenesis (15, 78-21). Overexpression 
of PD-L1 or PD-L2 has also been shown to be 
associated with tumor aggressiveness, disease 
progression, and high mortality in HCC patients 
(17, 22). Therefore, therapeutic agents that 
target immune checkpoints may potentially 
improve clinical outcomes by reversing the 
immunosuppressive nature of the HCC tumors and 
stimulating host immunity against the tumor cells. 
Interestingly, encouraging antitumor activity has 
now been reported in several clinical studies with 
anti-CTLA-4, anti-PD-1, and anti-PD-L1 antibodies. 

Tremelimumab is a fully humanized immuno- 
globulin G2 (lgG2) monoclonal antibody that 
blocks CTLA-4. In a phase 2 study reported in 
2013, of 21 HCV-positive patients, 17 were assess- 
able for tumor response (23). The tumors shrank 
or disappeared in three patients (17.6%), and the 
disease control rate was 76.4%, with a median time 
to progression of 6.48 months. This study provided 
the first evidence of the antitumor activities of im- 
mune checkpoint inhibitors in HCC, and greatly en- 
couraged further investigations and clinical trials. 

Several anti-PD-1 antibodies are currently being 
evaluated in HCC patients globally. In September 
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2017, the U.S. FDA approved nivolumab for the 
treatment of patients with HCC who have been 
previously treated with sorafenib. Accelerated 
approval for this indication was granted based 

on the tumor response rate and durability of 
response observed in the CheckMate-040 trial 
(24). In this trial, 14.3% [95% confidence interval 
(Cl): 9.2-20.8; 22/154] of patients responded 

to treatment with nivolumab. The percentage 

of patients with a complete response was 

1.9% (3/154), and the percentage with a partial 
response was 12.3% (19/154). Among responders 
(n=22), duration of response ranged from 3.2 

to 38.2+ months; 91% of those patients had 
responses lasting six months or longer and 55% 
had responses lasting 12 months or longer. The 
median time to response was 2.8 months (range: 
1.2-7.0). The overall response rate (based on 
modified RECIST) criteria was 18.2% (95% Cl: 
12.4-25.2; 28/154). Complete response rate was 
3.2% (5/154) and partial response rate was 14.9% 
(23/154), also based on modified RECIST criteria. A 
phase 3 randomized trial (CheckMate-459) is now 
ongoing to assess the clinical activity of nivolumab 
versus sorafenib in first-line HCC treatment 
(ClinicalTrials.gov: NCT02576509) (25). Another 
anti-PD-1 antibody, pembrolizumab, is also under 
investigation for treatment of HCC patients. The 
KEYNOTE-240 study is a phase 3 study to assess 
pembrolizumab versus placebo plus best support 
care as a potential second-line therapy in patients 
with previously systemically treated advanced 
HCC (ClinicalTrials.gov: NCT02702401) (26). This 
study is currently ongoing. 

In addition to anti-PD-1 antibodies, anti-PD-L1 
antibodies are also being evaluated in HCC pa- 
tients. A phase 1/2 clinical trial of durvalumab in 
predominantly sorafenib-pretreated HCC patients 
showed an ORR of 10%, with a median OS of 13.2 
months and a well-tolerated safety profile (Clin- 
icalTrials.gov: NCT01693562) (27). A separate 
phase 1/2 study of durvalumab in combination 
with tremelimumab in unresectable HCC showed 
an ORR of 25%, with no unexpected safety signals 
(ClinicalTrials.gov: NCT02519348) (28). Overall, the 
trial demonstrated that a regimen of durvalumab 
plus tremelimumab was well-tolerated in this unre- 
sectable HCC patient population. 

Many combinations of immunotherapies with 
molecularly targeted therapies are being evalu- 
ated in advanced HCC patients. Examples in- 
clude pembrolizumab/lenvatinib (ClinicalTrials. 
gov: NCT03006926) (29), nivolumab/galunisertib 
(ClinicalTrials.gov: NCT02423343) (30), nivolumab/ 
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yttrium Y 90 glass microspheres (ClinicalTrials.gov: 
NCT02837029) (31), and nivolumab/cabozantinib 
(ClinicalTrials.gov: NCT 01658878) (32). 

In addition to checkpoint inhibitors, chimeric 
antigen receptor T cell (CAR-T) therapy has also 
made significant progress in the field of cancer 
immunotherapy. Remarkable clinical outcomes 
have been shown with CAR-T treatment, especially 
CD-19-directed CAR-T in various hematologic 
malignancies (33). However, to date, the clinical 
activity of adoptive CAR-T treatment in solid tumors 
has been less impressive. In HCC, CAR-T therapies 
targeting various antigens, including CEA, MUC1, 
GPC3, EGFR, EpCAM, and CD133, are under 
investigation in early stage clinical trials (34). 


Immunotherapies for HCC in China 

HCC remains a cancer with a high mortality rate 
and a clear unmet medical need in China, despite 
the fact that multiple HCC prevention programs 
supported by the Chinese government have been 
in place for many years (35, 36). Based on data 
retrieved from the China National Central Can- 
cer Registry, estimated new cases of liver cancer 
numbered about 356,000 in China in 2011, and the 
incidence rate was 26.39 per 100,000. There was 
also an increasing trend in the incidence rate of 
liver cancer in China from 2000 to 2011. As a result, 
the burden of liver cancer is still very high in China 
(37). Development of effective therapies for HCC 
patients remains a big challenge and an important 
unmet medical need. 

With the boom in the Chinese biotech industry 
over the past five years, several China-based 
biopharmaceutical companies have been actively 
developing innovative immuno-oncology drugs 
for the treatment of cancers, including HCC. 
BGB-A317, developed by BeiGene, is an anti- 

PD-1 antibody engineered to minimize Fc-gamma 
receptor binding on macrophages, with the aim 

of abrogating antibody-dependent phagocytosis, 
a potential mechanism of T-cell clearance. Results 
from its ongoing phase 1 study were recently 
reported (38). At the time of data cutoff, 40 
patients with unresectable HCC had been enrolled. 
A majority (85%) of these patients were infected 
with HBV. The median treatment duration was 64 
days, with a range of 1 to 471 days. BGB-A317 was 
well tolerated: Twenty evaluable patients remained 
on the treatment at the data cutoff date; partial 
responses were observed in three patients (all HBV- 
positive) and nine patients achieved stable diseases, 
some of whom also had significant reductions in 
a-fetoprotein (AFP) levels, which indicates a positive 
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treatment response. Based on these initial results, a 
randomized, open-label, multicenter phase 3 study 
has been initiated to assess the efficacy and safety 
of BGB-A317 versus sorafenib in first-line patients 
with unresectable HCC. 

In addition, SHR-1210, a humanized anti-PD-1 
IgG4 antibody developed by Jiangsu Hengrui 
Pharmaceutical Co., is under investigation either 
as a single agent or in combination with apatinib 
or FOLFOX4 in HCC. There are also at least 15 
other anti-PD-1 and anti-PD-L1 antibodies from 
different pharmaceutical companies that have 
either received authorization from CFDA or are 
currently under review to initiate clinical studies in 
China (39). 

Glypican 3 (GPC3), a member of the glypican 
family of heparan sulfate proteoglycans, is high- 
ly expressed on the surface of liver cancer cells 
with minimal expression on normal tissues (40). A 
clinical trial was initiated by Chinese investigators 
from Renji Hospital in collaboration with CARsgen 
Therapeutics to investigate GPC3-targeted CAR-T 
in Chinese patients with GPC3-positive refractory 
or relapsed HCC. Its preliminary phase 1 results 
showed that GPC3-targeted CAR-T was well tol- 
erated in 13 Chinese patients with GPC3-positive 
HCC (41). Of these 13 patients, eight were treat- 
ed with lymphodepleting conditioning (LDC) at 
baseline. Among these eight patients, there was 
one partial response, three showed stable diseas- 
es, two had progressive diseases, and two were 
unevaluable. Taken together, GPC3-positive CAR-T 
therapy showed encouraging preliminary clinical 
activity in Chinese patients with advanced HCC. 


Conclusions 

HCC remains one of the most devastating malig- 
nancies in the world, and a disease with significant 
unmet medical need, particularly in China. Immu- 
notherapies, especially immune checkpoint inhibi- 
tors, have demonstrated preliminary but promising 
clinical activity in patients with HCC, and several 
large randomized phase 3 clinical trials of these 
treatments are currently underway. Despite the 
limited progress and challenges in the development 
of HCC treatment in the past decades, there is hope 
on the horizon. With the joint efforts of pharmaceu- 
tical companies and academic institutions, we look 
forward to the continuous development of safer, 
more effective treatments for patients with advanced 
or metastatic HCC. 
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Challenges and prospects 
for precision cancer 
immunotherapy in China 


Zhihao Lu, Jianling Zou, 
Shuang Li, and Lin Shen* 


ith increasing incidence and mortality, 
cancer has become a major public health problem 
in China (7). Surgery, radiotherapy, chemotherapy, 
and targeted therapy remain standards of care 

for cancer patients. However, the effectiveness of 
these treatments is not completely satisfactory. 

The success of immunotherapy, especially anti- 
PD-1/PD-L1 treatment, which blocks the ability of 
tumor cells to shield themselves from attack by 

the immune system, has led to some important 
strides in cancer care in recent years. At the same 
time, immunotherapy has become a hot area in 
cancer research and treatment in China. Here, we 
discuss the challenges and prospects for precision 
immunotherapy in the treatment of cancer, based on 
the unique characteristics of the Chinese population. 


Current status of cancer immunotherapy 
Chinese pharmaceutical companies and research- 
ers are highly motivated to explore new immuno- 
therapy agents and therapies. Even though no PD-1 
or PD-L1 inhibitor has yet been approved by the 
Chinese Food and Drug Administration (CFDA), the 
Chinese government is making earnest efforts to im- 
prove the overall clinical research environment, and 
the CFDA is actively reforming the regulatory frame- 
work to accelerate approval of novel agents (2). By 
July 2017, Chinese pharmaceutical companies had 
developed 10 anti-PD-1/PD-L1 inhibitors, eight of 
which have been approved by CFDA for phase 1 
to phase 3 clinical trials in patients with advanced 
solid tumors (Table 1). In November 2016, clinical 
trials of PD-1 inhibitor KNO35 were approved by the 
U.S. Food and Drug Administration (FDA) (3). In the 
past five years, around 1,000 international clinical 
trials of PD-1/PD-L1 therapies for solid tumors have 
been registered in the ClinicalTrials.gov database, of 
which 100 involved Chinese sites (2). Thus, Chinese 
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ClinicalTrials.gov 


company Identifier 
Shanghai Junshi Melanoma, urological NCT02836795, 
45001 Sa Biosciences Phase 1,2 cancer NCT03013101 
Breast cancer, 
Jiangsu Hengrui gastrointestinal NCT02742935, 
ote eS PE Medicine Prete Coc) Singey haonvoedluier  INeirxesea 
cancer 
P Various malignant NCT02407990, 
BGB-A317 PD-1 BeiGene Phase 1,2 Solid turmars NCT03209973 
Genolimzumab PD-1 Genor BioPharma Phase 1 Velie ieee NCT03053466 
solid tumors 
: : Various malignant NCT02937116, 
IBI308 PD-1 Innovent Biologics Phase 1, 2 solidturdors NCT03116152 
Recombinant humanized : , : é 
anti-PD-1 monoclonal PD-1 etna ereue Phase 1 maou elena NA 
$ Pharmaceuticals solid tumors 
antibody 
Recombinant humanized Vedcussnalianank 
anti-PD-1 monoclonal PD-1 Taizhou Hanzhong IND : g NA 
: solid tumors 
antibody 
Recombinant humanized Bio-Ther Var Be mete 
anti-PD-1 monoclonal PD-1 ieee IND ished tal nine ed NA 
: Solutions solid tumors 
antibody 
Recombinant humanized 
PD-L1 single-domain 3D Medicines/ Various malignant 
antibody Fc fusion pe El Alphamab piase| solid tumors Me 
protein 
Recombinant fully human : : 
anti-PD-L1 monoclonal PD-L1 CEuSne Phase 1 Vermeus maliginet’ NA 


antibody 


Pharmaceuticals 


solid tumors 


TABLE 1. PD-1/PD-L1 blockades in clinical trials in China. NA, none available; IND, investigational new drug. 


clinical trials are developing rapidly and catching up 
with those in other parts of the world. 

However, there are some problems with the 
current situation. The promise of these drugs is 
leading to a crowded market. Numerous compa- 
nies are jockeying for position, while many others 
are hoping for profitable out-licensing deals. All of 
this makes quality control a serious challenge. Fur- 


thermore, whether in pharmaceutical development 
or regimen design, China has been producing too 
many “me-too” drugs—innovation has been lacking. 
Finally, because of overreaching pragmatism and 
the current system for evaluating research (which 
places great importance on publication numbers), 
it is possible that applied research could impede 
the progress of pure science. 


Challenges facing cancer immunotherapy 
Anti-PD-1/PD-L1 treatment is one of the most 
successful immunotherapies against solid tumors, 
but it faces many difficulties in China. Several of 

these relate to biomarkers: 


Lack of predictive biomarkers for 
treatment response 

Multiple tumor types have been shown in early 
phase trials to have a long-lasting response to anti- 
PD-1/PD-L1 treatment (4, 5). However, the overall 
response rate to this treatment is only 10%-44% 
(6, 7). Therefore, better biomarkers are needed 
to guide patient selection and provide early 
indicators of treatment response. 


Lack of predictive biomarkers for 
adverse events 

Adverse effects, including lethal complications, 
have been reported during anti-PD-1/PD-L1 treat- 
ment (8). Most clinical trials focus on potential pre- 
dictive biomarkers of treatment response, but no 
reliable biomarkers for adverse events have been 
identified. However, they are equally necessary in 
the clinic. 


Lack of predictive biomarkers for 
hyperprogressive disease 

Champiat and colleagues reported for the first 
time instances in which a cancer progressed 
unusually rapidly in patients receiving immune 
checkpoint inhibitors like anti-PD-1/PD-L1, 
described as “hyperprogressive disease.” The 
incidence varied from 9% to 29% (9, 10) of patients 
receiving the immune checkpoint therapy. As 
this study indicates, it is necessary to identify 
biomarkers for hyperprogression in order to 
selectively avoid those patients who might be 
harmed by immune checkpoint inhibitors. As of 
now, unfortunately no such explicit biomarkers 
have been found. 

In addition to these difficulties, Chinese patients 
have some special characteristics that need to be 
considered in designing immune therapies for 
them. They have relatively high rates of hepatitis B 
virus infection, for example, and more frequently 
use Chinese traditional medicine. Chinese patients 
also have more “driver gene” mutations that are 
linked to cancer development, better clinical 
responses to chemotherapy, and different toxicity 
profiles (71-14), which may have an effect on anti- 
PD-1/PD-L1 therapy. These factors have rarely 
been considered in trial designs. Furthermore, 
Chinese patients have limited access to immune 
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checkpoint inhibitors such as antibodies against 
PD-1 and PD-L1, so there has been only a small 
patient pool in which to explore biomarkers in the 
Chinese population. 


Toward precision cancer immunotherapy 
in China 

Because of the dynamic nature of the immune 
system and the multiple elements involved in 
the complex immune response to cancer, it is 
much more challenging to develop biomarkers 
for immunotherapy than for other treatments. 
Thus, it is imperative to establish a paradigm for 
precision immunology to steer patients through 
immunotherapy. Some strategies for precision 
cancer immunotherapy include the following: 


1. Mechanism-based predictive biomarkers for 
patient selection 


Mismatch repair deficiency 

DNA mismatch repair (MMR) describes a 
specific genetic aberration that results in a many- 
fold increase in tumor mutational burden in 
various tumor types and has been explored for 
responsiveness to anti-PD-1 (75). The high level of 
microsatellite instability (MSI-H)—a predisposition 
to mutation that is characteristic of MMR-deficient 
(dMMR) cancers—or the absence of one of the MMR 
proteins, is used to identify this genetic subset of 
patients. 

In 2017, the U.S. FDA granted accelerated 
approval to pembrolizumab for the treatment of 
adult and pediatric patients with unresectable or 
metastatic solid tumors that have been identified 
as MSI-H or dMMR. The approval covers patients 
whose disease has progressed after prior 
treatment and who have no satisfactory alternative 
treatment options. However, only about 4%-5% 
of patients with advanced solid tumors have 
been identified as MSI-H or (MMR, while their 
therapeutic response is approximately 40% (76). 
Thus, further exploration of novel biomarkers is 
needed to refine anti-PD-1 treatment strategies. 


PD-L1 expression 

The protein PD-L1, detected on patient tumor 
cells using immunohistochemistry (IHC), is 
currently the most common clinically detected 
biomarker for predicting patient response to anti- 
PD-1/PD-L1 therapy (17). 

A recently published randomized phase 3 trial 
in advanced non-small cell lung cancer (NSCLC) 
revealed a correlation between high PD-L1 
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expression and longer progression-free survival 
and overall survival in immunotherapy treatment 
regimens (78). Nevertheless, a recent review 

of immune biomarker utility in NSCLC therapy 
indicated a lack of consensus on how to use PD- 
L1 expression as a biomarker to help in selecting 
treatment regimens. In fact, patients with little or 
no detectable expression of this biomarker can 
also benefit from anti-PD-L1 therapy (79). A similar 
scenario has been observed in other tumors 

such as gastric cancer (20), renal cell carcinoma 
(21), and urothelial cancer (22). However, 

due to the complexities of PD-L1 IHC, further 
studies are needed to carefully validate these 
observations and other characteristics of the tumor 
microenvironment. 


Mutational landscape and mutational burden 

Various studies have shown how the number of 
mutations in a tumor’s genome-the mutational 
burden—correlates with greater efficacy of diverse 
anti-PD-1/PD-L1 drugs (23, 24). Analyzing the 
genetic signatures of tumors might therefore 
identify patients who have a better chance of 
responding to checkpoint inhibition (25). NSCLC 
patients with a high nonsynonymous mutational 
burden showed long-lasting clinical benefits from 
treatment with pembrolizumab. Furthermore, 
another study showed that almost all patients with 
a low nonsynonymous mutational burden did not 
benefit from the same treatment (24). These results 
make the mutational landscape of the tumor one 
of the most promising predictive factors both at 
the beginning of treatment and after an initial 
favorable response. 


Other predictive biomarkers 

It is well known that there is a relationship 
between inflammatory cells, cancer, and 
proinflammatory proteins such as chemokines. 
Some findings demonstrate that the genetic 
signatures of genes related to the cytokine 
interferon y appear to be reasonable tumor 
biomarkers that could soon be incorporated into 
predictive models of how patients will respond to 
checkpoint inhibitors (26, 27). 

Recent studies have also shed light on the 
previously unknown effect of the gut microbiota 
on immunotherapy for patients being treated with 
checkpoint blockade inhibitors (28, 29). Future 
studies of the gut microbiota of cancer patients 
who respond to checkpoint inhibitors could 
provide invaluable information to aid in patient 
selection. 


Findings from 2017 show that CKLF-like MARVEL 
transmembrane (CMTM) domain-containing 
proteins CMTM6 and CMTM4 are associated 
with the PD-L1 protein and enhance the ability of 
tumor cells that produce PD-L1 to inhibit T cells, 
suggesting a potential therapeutic target in the 
effort to overcome immune evasion by tumor cells 
(30, 31). 

Results of completed trials indicate that 
numerous biomarkers that predict toxicity to 
antibodies against immune checkpoint inhibitor 
cytotoxic T-lymphocyte antigen 4 (CTLA-4) 
should be considered for further evaluation as 
an anticancer treatment in larger prospective 
trials. Of these promising biomarkers, neutrophil 
activation markers and several immunoglobulin 
genes warrant further investigation as biomarkers 
predictive of toxicity to anti-CTLA-4 treatment. 
There is currently no predictive biomarker for 
toxicity to anti-PD-1 treatment. Evaluating the 
gene expression profile of patients who develop 
toxicities to anti-PD-1/PD-L1 treatment may also be 
an interesting landscape to explore in the future. 


2. Predictive biomarkers related 
to hyperprogression 

Champiat and colleagues at Université 
Paris Saclay found a greater incidence of 
hyperprogression in patients 65 years old or 
older than in those younger than 65 (9). That may 
be because the functioning of immune cells, 
chemotaxis, phagocytosis, and intracellular killing 
of pathogens is decreased in older patients (32), 
but the mechanism by which this happens is not 
clear. Saada-Bouzid and colleagues (10) found that 
hyperprogression correlated more significantly 
with the presence of a recurrence near the original 
tumor than with a recurrence farther away (90% 
versus 37%) in patients with recurrent or metastatic 
head and neck squamous cell cancer. Kato and 
colleagues (33) investigated potential genomic 
biomarkers associated with hyperprogression after 
immunotherapy. Their results showed that murine 
double minute 2 (MDM2) family amplification 
or epidermal growth factor receptor (EGFR) 
alterations were correlated with hyperprogression 
in patients previously treated with PD-1/PD-L1 
inhibitor, but not CTLA-4. However, the exact 
mechanism linking MDM2 family amplification 
and hyperprogression is unclear. Pretreatment 
and early postimmunotherapy biopsies would be 
valuable in exploring the underlying mechanism 
in the future. 
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FIGURE 1. Opportunities for precision cancer immunotherapy and combination therapy strategies. Multiple approaches that target 
different pathways are required for successful anticancer treatment, including T-cell activation, blockade of immune inhibition, and 
correct recognition of tumor antigens. The combination immunotherapies represented here offer the possibility for achieving improved 
efficiencies in cancer treatment. TAM, tumor-associated macrophage; MDSC, myeloid-derived suppressive cell; Treg, regulatory T cell; 
APC, antigen-presenting cell; GITR, glucocorticoid-induced tumor necrosis factor-related protein; CTLA-4, cytotoxic T-lymphocyte 
antigen 4; TCR, T-cell receptor; MHC, major histocompatibility complex; IDO, indoleamine 2,3-dioxygenase; TAAs, tumor-associated 


antigens. 


3. Combination immunotherapy 

In recent years, there has been an increase 
in the number of novel combination therapies 
used in cancer immunotherapy treatments. The 
goal of combination approaches is to expand 
the spectrum of patients who would benefit from 
immunotherapy, to enhance its curative effects, and 
to reduce the number of side effects compared with 
single therapy regimens (Figure 1). 

Several hundred clinical trials are currently 
exploring the effects of anti-PD-1/PD-L1 used 
in combination with experimental immune 
modulators, such as other monoclonal antibody 
checkpoint inhibitors, cancer vaccines, epigenetic 


drugs, and drugs targeting metabolic pathways, 
and also with surgery, radiotherapy, and 
chemotherapy. Preliminary evidence [for example 
using a combination of PD-1 and CTLA-4 in 
treatment of melanoma (34) and NSCLC (35)], has 
highlighted the potential to enhance the clinical 
benefits of monotherapies by combining them with 
agents that have synergistic mechanisms of action. 
Initial analyses from the international, multicohort, 
phase 2 KEYNOTE-059 study showed that objective 
response rate (includes both partial and complete 
responses) was 60% across all patients (36). 
Despite the promise of combination therapies, 
several issues need to be further investigated: 
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1. Combination agents. Although there are 
dozens of clinical trials of immune modulators 
used in combination with anti-PD-1/PD-L1 
treatment, there is little research in China on the 
rationale, dosages, schedules of treatment, and 
configuration of the specific combinations used. 

2. Predictive biomarkers for combination 
immunotherapy. Although studies have found the 
rate of response to combination therapies to be 
much higher than that of monotherapies, there are 
still a proportion of patients who do not survive. 
Finding predictive biomarkers for combination 
immunotherapy is therefore of prime importance, 
and will probably require dynamic monitoring of 
the immune response and a fuller understanding 
of the changes in tumor microenvironments 
before, during, and after treatment. 

3. Predictive biomarkers for adverse events. 
Combination immunotherapy usually brings more 
instances of adverse events. In the KEYNOTE-059 
study, grade 3-4 treatment-related adverse 
events (TRAEs) including neutropenia, stomatitis, 
anemia, thrombocytopenia, anorexia, and 
fatigue, occurred in 76% of patients—a much 
higher frequency than with chemotherapy or 
targeted therapies. We should therefore pay more 
attention to TRAEs associated with combination 
therapy, and attempt to identify novel predictive 
biomarkers for them. 


Conclusions 

The ultimate goal of precision immunology 
for cancer is to select a subset of patients with a 
common biological basis of cancer: the patients 
who are most likely to benefit from a particular 
immunotherapy. Hopefully, with the development 
of new technologies to dynamically monitor 
the cancer-immune system interaction and by 
considering the special features of different 
populations, precision cancer immunology will 
be applied more widely, improving diagnosis, 
stratification, and treatment of cancer patients in 
China. 
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The rise of engineered 
T-cell therapy in China 


Jianshu Wei’, Yi Zhang2*’, and Weidong Han” 


[ here are many treatment methods for con- 


trolling or eliminating cancer cells, including tumor 
vaccines, chemotherapy, radiation therapy, and 
tyrosine kinase inhibitors. The advancement of basic 
research into the progression of malignancies has 
resulted in significant improvements in treatment. 
However, complete and durable remission for inop- 
erable malignancies remains rare. There is a tremen- 
dous effort underway to find novel therapies that can 
offer better prospects for destroying tumors. 

Recently, immunotherapy as a tumor treatment 
has come to the forefront because of the significant 
successes of therapies using immune checkpoint 
antibodies and chimeric antigen receptor modified 
T (CAR-T) cells. Simply speaking, immune checkpoint 
antibodies fight tumors by activating the naturally 
occurring lymphocytes in the body that can rec- 
ognize and kill the tumor cells. CAR-T cells are the 
patient's own cells educated in vitro by genetic 
manipulation to kill tumor cells. The immune system 
is so sophisticated that the mechanisms underlying 
cancer immunology are far from fully explained, al- 
though there has been steady progress over the past 
30 years. As clinical applications have developed, T 
cell-centered cancer immunotherapy has proven to 
be an effective antitumor treatment. 

In 2017, the U.S. Food and Drug Administration 
(FDA) approved the drug CTLO19 (tisagenlecleu- 
cel-T, Novartis), the first CAR-T treatment to get the 
regulatory nod. It was a milestone in tumor immu- 
notherapy, marking the start of the CAR-T era. With 
strong support from the government, CAR-T thera- 
pies are developing very rapidly in China, where the 
number of registered clinical trials recently sur- 
passed that in the United States. 

Cancer immunotherapy has been in clinical use 
in China since before 1988, in the form of a method 
using lymphokine-activated killer (LAK) cells. LAK 
cells are white blood cells stimulated to grow in vitro 
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and cultured in the presence of interleukin 2 (IL-2). 
In 1988, Rosenberg and colleagues first reported 
a method for preparing LAK cells (7). Since then, 
LAK cells have been used to treat multiple types of 
tumors (2). 

In 1991, Schmidt-Wolf reported on cytokine- 
induced killer (CIK) cells (3). CIK cells were more 
cytotoxic than LAK cells and exhibited better 
proliferation efficiency. Several clinical trials on CIK 
cells have been conducted since then. In China, the 
first research article on the subject was published 
in 2002 (4). Subsequently, CIK therapy has been 
widely utilized for a variety of reasons, including its 
safety, relative technical simplicity, and the support 
it received from the government. During this period, 
other treatments using natural killer (NK) cells and 
dendritic cell-activated CIK cells (DC-CIKs) were 
also in clinical use, providing a solid foundation for 
the development of newer CAR-T treatments. In 
general, the tumor-killing cells utilized during that 
time were natural lymphocytes—small white blood 
cells—with no modifications, and the killing effect was 
weak and depended largely on nonspecific tumor 
cell recognition mechanisms. As a result, the overall 
clinical benefits were minimal. 

To improve the clinical efficacy of these 
therapies, Chinese scientists have tried a variety 
of ways to increase the number of T lymphocytes 
that recognize a specific tumor, of which CAR-T 
technology is undoubtedly the most attractive. The 
generation of the first CAR molecule was reported 
in 1989 by Gideon Gross (5). This first-generation 
CAR-T showed poor activity in phase 1 clinical trials. 
The first domestic research using improved second- 
generation CAR-T was published in 2009, and 
involved a human epidermal growth factor receptor 
2 (HER2)-specitic CAR that was designed for the 
treatment of breast cancer (6). The first article on 
CAR-T clinical trials in China was published in 2014 
by the Han laboratory of the People’s Liberation 
Army General Hospital (PLAGH) (7). With the 
government restricting the clinical use of CIK and 
similar minimally successful immune therapies, the 
use of natural lymphocytes gradually diminished, 
and China entered a new era of precision 
immunotherapies, exemplified by CAR-T. 

By early 2015, 12 clinical trials of CAR-T in China 
were registered on the ClinicalTrials.gov website, 
of which eight were conducted by PLAGH, two by 
Beijing Cancer Hospital, and one each by Renji 
Hospital and the 307 Hospital of the People’s 
Liberation Army. By January 2016, the total number 
of trials had risen to 26. As of September 2017, 121 
CAR-T trials have been registered by domestic 


hospitals, academic institutions, and private 
companies, and China has become the country 
with the most registered CAR-T trials worldwide. 

Of the 121 registered trials, 53 are CAR-T therapies 
targeting CD19 for treatment of leukemia, and 44 of 
these are still ongoing. CD19 is a membrane protein 
found on all B cells and critical for B-cell activation 
and proliferation. Generally speaking, acute lym- 
phocytic leukemia (ALL) is most responsive to this 
therapy, known as CART-19. Five of the nine patients 
enrolled in the first domestic CART-19 trial saw 
complete remission (CR) for two to nine months (8), 
and cytokine release syndrome (CRS) was observed 
in most cases. CRS occurs when large numbers of 
white blood cells release proinflammatory cytokines, 
which go on to activate more white blood cells, 
creating a potentially fatal cascade effect called a 
“cytokine storm.” Two patients who had previously 
received hematopoietic stem cell transplantation 
(HSCT) presented with grade 2-3 graft-versus-host 
diseases (GVHD) after infusion of donor-derived 
CAR-T cells, suggesting that HSCT-treated patients 
without 100% chimerization (i.e., in which the hema- 
topoietic system was not completely reconstructed 
from donor-derived bone marrow cells) had a higher 
risk of GVHD when receiving donor-derived CAR-T 
cells. Overall, the rate of complete remission is 
a bit lower than that seen in results obtained at the 
Memorial Sloan Kettering Cancer Center (MSKCC), 
the University of Pennsylvania, and Novartis. It is 
worth noting that the patients recruited in this trial 
were more aggressively treated and in later stages 
of disease than the participants in the United States. 
Results from several subsequent domestic CART-19 
trials have shown that overall CR rates were com- 
parable to those in the United States. For example, 
the First Affiliated Hospital of Zhejiang University 
enrolled 15 patients with relapsed or refractory 
(R/R) ALL (9), 12 of whom had CR responses one 
month after CART-19 infusion. 

Relapse is the major obstacle to long-term survival 
of patients hoping for complete remission. Research- 
ers in China have been exploring ways to avoid 
relapse. The first option is to use bispecific CAR-T 
cells that are active against two different antigens. 

A group from the Southwest Hospital in Chongqing 
has registered a trial in which CART-19 and CART-20 
(targeting CD-20) will be infused sequentially into 
patients with diffuse large B-cell lymphomas (DLB- 
CL). And the PLAGH group has carried out two trials 
in which tandem CART-19/20 and tandem CART- 
19/22 were used for patients with R/R B-cell malig- 
nancies. Another common approach is to initiate 
HSCT treatment after obtaining a CR response using 
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CAR-T therapy. A group at Lu Daopei Hospital in Bei- 
jing has treated 27 patients with this regimen (70). 
Of the 27 CR patients treated with allogeneic HSCT 
following CART-19, 23 remained disease-free for at 
least 45 days (median follow-up time was 206 days; 
range was 45-427 days). This suggested that further 
use of allogeneic HSCT could effectively reduce the 
risk of relapse. The Southwest Hospital group has 
also registered a trial in which HSCT will be given 
after CART-19 treatment for patients with B cell ma- 
lignancies. In collaboration with PLAGH, a team led 
by Huisheng Ai launched the world's first treatment 
to use coinfusion of haploidentical donor-derived 
CART-19 cells and mobilized peripheral blood stem 
cells (PBSCs)—stem cells stimulated to move from the 
bone marrow into the bloodstream-—for a 71-year- 
old female with R/R ALL(77). Any residual disease 
was undetectable following treatment, and CRS and 
GVHD were controlled. This result suggested that 
coinfusion of allogeneic CAR-T cells and PBSCs may 
induce full donor engraftment in R/R ALL, offering a 
novel option for combining CAR-T and HSCT. 

CD20 and CD22 have expression patterns sim- 
ilar to those of CD19, which are expressed in all 
B lymphocytes except B progenitor cells, making 
these proteins potential targets for managing B-cell 
malignancies. At present, there are five CD20 and six 
CD22 CAR-T trials registered in China. The initial ar- 
ticle on a CD20 CART trial was published in 2014 by 
researchers at PLAGH (7), being the first to report re- 
sults of CART-20 therapy for DLBCL. Seven patients 
with DLBCL were treated with CART-20 cells, and five 
of the six evaluable patients showed an objective 
remission response (ORR). 

ALL and B-cell lymphoma are the cancers most 
sensitive to CAR-T therapy, so they are prime 
candidates for further research, both in China and 
abroad. But China’s research groups have also done 
pioneering work on other types of blood-based 
malignancies. 

Multiple myeloma (MM) is a fatal plasma cell 
cancer and the second most common hematolog- 
ical malignancy. Typically, CD19, CD20, and CD22 
are not detectable in MM. Clinical results have 
suggested that CD138, a transmembrane proteo- 
glycan found in certain hematopoietic cancers and 
carcinomas, may be an effective target for refractory 
and progressive MM. In 2015, researchers at PLAGH 
reported the first results of CART-138 therapy on 
MM. To date, five patients have been treated, and in 
four the disease was stabilized for more than three 
months, while one patient with progressive MM had 
a significant reduction in myeloma cells in peripheral 
blood—from 10.5% to less than 3% (12). This study 


suggested that CART-138 could be a promising 
approach for treating MM. There are three CART-138 
clinical trials presently registered in China. 

B-cell maturation antigen (BCMA), a protein 
reported to be expressed in B cells, including MM 
cells, is another potential target for MM treatment. 
At the 2017 conference of the American Society 
of Clinical Oncology (ASCO), Nanjing Legend 
Biotech Co., Ltd. presented data from their trial of 
BCMA-targeted CAR-T therapy (CART-BCMA) in 35 
participants with R/R MM. Thirty-three evaluable 
patients saw significant clinical responses (defined 
as a complete response or a very good partial 
response) within two months following CAR-T 
infusion. The overall ORR was as high as 100%. Four 
CART-BCMA clinical trials are currently registered 
in China. 

Acute myeloid leukemia (AML) is the most com- 
mon acute leukemia affecting adults (13), and CD33 
antigen is expressed on leukemic cells in more than 
90% of patients with the disease (74). In 2015, a 
clinical trial was conducted at PLAGH to assess the 
feasibility and efficacy of CART-33—CAR-T therapy 
targeting CD33-for the treatment of R/R AML. A 
marked decrease of myeloid blasts in the bone mar- 
row two weeks after cell infusion was observed, but 
the disease progressed after nine weeks (75). This 
was the first demonstration of the potential of CART- 
33 for the treatment of AML. Five clinical trials for 
treating AML with CART-33 are currently registered 
in China. 

In Hodgkin’s lymphoma (HL), the CD30 protein is 
a potentially potent target. This year, the first phase 
1 clinical trial results on CART-30—CAR-T therapy di- 
rected at CD30—were reported by PLAGH research- 
ers. Eighteen patients with progressive R/R HL were 
enrolled. The data showed that the treatment was 
well tolerated without severe toxicity. Seven patients 
achieved partial remission and the disease was stabi- 
lized in six patients (76). There are now four CART-30 
clinical trials underway in China. 

Compared with the performance of CAR-T seen 
with hematologic malignancies, there is general 
pessimism about the prospects of this therapy for 
treatment of solid tumors. Chinese researchers have 
nonetheless pursued this possibility with enthusiasm 
and have made many original contributions. Current- 
ly, 44 clinical trials of CAR-T cell treatments of solid 
tumors have been registered in China, covering 14 
different targets. 

Among them, the glypican-3 (GPC3) protein has 
attracted the most interest, with nine trials underway. 
GPC3 is generally overexpressed in multiple solid 
tumors, including lung cancer and hepatocellular 
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carcinoma (HCC) (17, 18). A group led by Zonghai 
Li from Renji Hospital has done several preclinical 
studies to verify the feasibility of CART-GPC3 in the 
treatment of lung cancer and HCC (18, 79). 

Epidermal growth factor receptor (EGFR) is a gly- 
coprotein that is overexpressed in many malignan- 
cies that derive from the epithelium. These include 
non-small cell lung cancer (NSCLC) and multiple 
digestive tract cancers (20). A preclinical study of 
CART-EGFR was carried out in 2013 by Yuquan Wei's 
laboratory at Sichuan University, which provided an 
experimental basis for further clinical study (20). The 
first CART-EGFR clinical results were published by a 
group at PLAGH in 2016. Patients with EGFR-positive 
(>50% expression) R/R NSCLC received escalating 
doses of CART-EGFR. Among 11 evaluable patients, 
two achieved partial remission and five had a stabi- 
lized disease response for two to eight months. Sig- 
nificantly, eradication of EGFR-positive tumor cells 
was observed in tumor biopsies after the CAR-T infu- 
sion, providing the most direct evidence supporting 
the effectiveness of this treatment (27). However, 
this result also emphasized that the heterogeneity of 
solid tumors may present a formidable obstacle to 
the CAR-T treatment. Six CART-EGFR trials have been 
registered in China as of now. 

Several other domestic clinical trials of CAR-T treat- 
ment for solid tumors have been published. In 2016, 
a group headed by Lin Yang published a case report 
on CAR-T therapy targeting mucin 1 (MUC1) for a 
patient with MUC1-positive metastatic seminal ves- 
icle cancer in which significant tumor necrosis was 
observed (22). In 2017, a group at PLAGH published 
results from a phase 1 clinical trial using CAR-T cells 
to target HER2 in patients with advanced biliary tract 
cancers and pancreatic cancers (23). Cheng Qian’s 
group published results of a phase 1 trial of CAR-T 
cells targeting carcinoembryonic antigen (CEA) 
therapy for patients with CEA-positive metastatic 
colorectal cancers (24). Data from all of these stud- 
ies demonstrated the safety and feasibility of CAR-T 
therapy for a variety of cancers. 

What can be done to provide patients with greater 
clinical benefits following CAR-T treatment? To 
answer this question, Chinese research groups have 
done some pioneering work. For example, cancer 
stem cells (CSCs) are believed to play an important 
role in tumor growth and metastasis. Theoretically, 
CAR-T therapy targeting CSCs might effectively 
constrain tumor metastasis. The antigen CD133 is 
thought to be a specific marker for CSCs, and is 
also expressed by various carcinomas including 
cholangiocarcinoma (CCA), a cancer of the bile 
ducts. A 2017 article reported on a 52-year-old 


patient with advanced CCA who was treated with 
CART-133 after seeing a partial remission following 
CART-EGFR treatment. A further 4.5-month partial 
remission was observed (25). Although the killing of 
CSCs by CART-133 was not conclusively shown, the 
novel sequential use of the CAR-T cocktail indicated 
intriguing possibilities. Epithelial cell adhesion 
molecule (EpCAM)—a transmembrane glycoprotein 
involved in epithelial cell adhesion—is another CSC 
marker, and several preclinical studies on CART- 
EpCAM cells have been published in China (26, 
27). Currently, four Chinese CART-EpCAM trials are 
underway. 

CAR-T research in China has been developing 
rapidly in recent years. However, we cannot overlook 
the gap in domestic research relative to that seen 
internationally. 

For example, although CAR-T treatment for he- 
matological malignancies is efficient, researchers in 
the United States continue to make improvements. 
A group at the Fred Hutchinson Cancer Research 
Center in Seattle, Washington has been working 
to optimize the composition of CAR-T treatments. 
Clinical data has indicated that optimizing the types 
of T cells used (e.g., including higher percentages 
of naive or central memory T cells) or utilizing a 
defined ratio of CD4-positive to CD8-positive T cells 
gives better clinical results (28, 29). Along these 
lines, Xinqiao Hospital in Chongqing has registered 
a CART-19 clinical trial that will use a CAR-T therapy 
in which a higher percentage of memory T cells will 
be used. A case report from Carl June’s group at the 
University of Pennsylvania demonstrated the possi- 
bility of treating MM with CART-19 cells, something 
previously considered to be impossible (30). Such 
innovative attempts to subvert traditional ideas rarely 
occur in China. 

For CAR-T cell preparation, the vast majority of do- 
mestic clinical trials use viral transfection. In contrast, 
the University of Texas MD Anderson Cancer Center 
has been promoting the use of the so-called “sleep- 
ing beauty” system, a synthetic DNA transposon 
vector designed to introduce DNA sequences into 
chromosomes following electroporation (31). These 
scientists believe this system can better meet future 
demands for lower cost and greater gene-editing 
capabilities. 

The challenges related to solid tumor treatment 
have motivated researchers in the United States to 
develop new strategies and clinical applications 
that have been rapidly translated into the clinic. 
One such strategy was developed by a group from 
City of Hope National Medical Center (CHNMC) 
in Duarte, California. In order to overcome 
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the immunosuppressive microenvironment, 

they designed a PD-L1:CD28 chimeric 

receptor, converting an inhibitory receptor to 

a costimulatory receptor by exchanging the 
transmembrane and cytoplasmic protein domains 
of PD-L1 with that of CD28. This PD-L1:CD28 
chimera exhibited enhanced functional attributes 
and demonstrated a novel strategy to overcome 
PD-L1-mediated immunosuppression (32). Another 
new strategy was developed by a group from 
MSKCC to avoid “on-target, off-tumor” toxicities; 
the group designed an antigen-specific inhibitory 
CAR that could constrain the activation of T cells 
when a specific antigen was encountered (33). 

To reduce the risk of off-target toxicity, affinity- 
tuned CARs that could more precisely distinguish 
tumor cells from normal cells were tested by 

Carl June’s group (34). Unlike in the United 
States, original research in this area is lacking in 
China. However, itis gratifying to note that some 
innovative clinical trials have also been initiated 
there. For example, six trials in which CAR-T cells 
will be modified to express soluble PD-1 or CTLA-4 
antibodies to overcome the immunosuppressive 
microenvironment have been registered. 

It is true that clinical trials in the United States are 
more varied than those in China; China’s clinical 
trials are relatively elementary and mostly similar 
to one another. Yet the diversity of U.S. trials has 
brought us valuable insights into how to design 
effective CAR-T treatments. For example, original 
U.S. studies have illuminated the benefits of using a 
preconditioning regimen with the drugs fludarabine 
and cyclophosphamide to improve the proliferation 
and persistence of CAR-T cells, and have shown how 
to optimize CAR-T preparation protocols and the 
framework of the CAR molecule. 

Compared with traditional drugs, CAR-T therapy 
offers more opportunity for businesses—especially 
small companies—because of the great diversity 
of targets and the many possible ways to optimize 
CAR molecular structures and preparation methods. 
American businesses have also made outstanding 
contributions to the industrialization of CAR-T 
therapy. Of the seven ongoing phase 2 CAR-T 
trials, six are being conducted by three companies 
(Kite Pharma, Juno Therapeutics, and Novartis). 
However, in China, most CAR-T trials are led by 
institutions or hospitals. Business participation has 
been mainly limite d to providing technical support 
or supplying viruses. The most effective way to 
encourage domestic enterprises to participate in 
the industrialization of CAR-T therapy is a topic 
deserving further discussion. 
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The U.S. FDA has issued clear guidelines and regu- 
lations for clinical application of CAR-T therapy after 
extensive input from experts in relevant fields. In 
China, no formal regulations on CAR-T therapy have 
yet been issued. This policy uncertainty is a cause for 
concern for businesses. Achieving consensus within 
the field and deepening communication with regu- 
lators has become a top priority for the industry. In 
recent years, several related associations and confer- 
ences have been established, including the Chinese 
Research Hospital Association (CRHA), the China 
Biotherapy Conference (CBC), and the China Dis- 
ease Biotherapy Conference (CDBC). Meetings over 
the past few years have effectively promoted com- 
munication, cooperation, consensus building, and 
the standardization of clinical trials. It is gratifying 
that the regulatory authorities have issued a policy 
draft on CAR-T therapy, and that formal regulations 
are expected by the end of 2017. We believe that 
CAR-T therapy will be regulated as a novel drug type 
in China as it is in the United States. Furthermore, 
the government has been actively supporting CAR-T 
therapy; last year, it announced 14 key research proj- 
ects, and CAR-T therapy was among them. 


The future of CAR-T therapy 

The side effects of a CAR-T cell infusion can in- 
clude serious consequences, even death. Without 
proper control, serious clinical accidents could cause 
significant human and financial losses. For example, 
a phase 2 CART-19 trial sponsored by Juno Ther- 
apeutics was terminated this year because three 
patients died from cerebral edemas. It is therefore 
very important to establish more clinical teams with 
experience in immunotherapy. And it is strongly rec- 
ommended that the businesses involved in produc- 
ing the therapies establish close cooperation with 
experienced clinical teams who can help reduce the 
risk of clinical accidents. 

The preparation of CAR-T cells is complicated 
and greatly affected by variation among patients. 
This places a heavy burden on the preparation 
technologies, which may be an obstacle for some 
companies wishing to carry out CAR-T clinical trials. 
To address this problem, some domestic enterprises 
have begun to develop automated and standardized 
CAR-T preparation platforms. For example, 
Sinobiocan (Beijing) Biology Technology is building 
a one-stop CAR-T preparation apparatus, including 
an automated cell-preparation platform, and is also 
providing a steady supply of reliable reagents. As 
an alternative, “off-the-shelf” universal CAR-T cells 
can be prepared using the CRISPR/Cas9 system and 
other gene-editing technologies. Currently, there are 
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two clinical trials utilizing universal CAR-T technology 
ongoing in China. 

The deficiencies mentioned above point to a clear 
path for improvements to the current status of CAR-T 
research in China. We believe that this powerful form 
of cancer treatment will undergo rapid development 
with the Chinese government's active support and 
with our joint efforts. 
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Precision cancer medicine 
and immunology in China 


Xu-Chao Zhang’? and Yi-Long Wu" 


| recision medicine in the context of clinical 


oncology treats patients by devising therapies 
based on information about genetic and epigenetic 
changes to the genes in their tumors. Canonical 
examples of such mutated genes include 
epidermal growth factor receptor (EGFR) and 
anaplastic lymphoma kinase (ALK) genes in lung 
cancer, human epidermal growth factor receptor 
2 (HER2) in breast cancer, and the BCR-ABL gene 
fusion in chronic myeloid leukemia (CML), among 
many others. The term “precision medicine” 
incorporates personalized medicine, since ina 
clinical setting it implies choosing the right drug 
for the right molecular target in the right patient. 
Precision medicine has dramatically changed 
clinical practice and is further shaping the 
landscape of oncology treatment standards. 
Chinese investigators have contributed 
significantly to the establishment of standard 
treatments for non-small-cell lung cancers 
(NSCLCs) in which the EGFR and ALK genes 
are mutated. In 2009, the world’s first phase 3 
study was carried out showing the efficacy of 
first-generation EGFR tyrosine kinase inhibitor 
(TKI) drugs in patients whose tumors had a 
mutated EGFR gene—a landmark in the study and 
treatment of lung cancer (7). Since then, there 
have been further studies of second- and third- 
generation EGFR TKls in China (2-5). In addition 
to EGFR, several trials of drugs targeting ALK, 
ROS1, and other so-called “driver genes” for 
lung cancer have generated good evidence for 
the establishment of new standards of care for 
patients with specific genomic alterations, which 
also promoted the approval from the China Food 
and Drug Administration (CFDA) for new drugs to 
be marketed in China (6, 7). Research into immune 
therapies has also been making rapid progress, 
and some have already been approved for use in 
the United States and other countries. Early clinical 
trials of these therapies in China are coming 
to fruition, and it is hoped that the resulting 
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treatments will be approved by the CFDA in the 
near future. This area is very competitive in China 
because of the number of similar drugs available 
internationally; several local antibody-based drugs 
are entering the arena, although they are still only 
in early trials. 

Progress in precision medicine has already 
changed the practice of oncology in China, which 
is seeing broader use of genotyping and TKI- 
based treatments. Nonetheless, the availability of 
targeted therapies is affected by several factors 
and could benefit from certain improvements. For 
example, the processes by which Chinese patients 
are selected to receive immunotherapy need both 
regulatory and clinical consensus. 

In this article, we will talk about the current state 
of precision medicine in China, specifically in the 
treatment of lung cancer. We will also address 
future directions and strategies in this area. 


Precision medicine in lung cancer treatment 

Lung cancer is a highly heterogeneous group 
of diseases. In Chinese lung cancer patients, the 
major driver genes—including EGFR, ALK, ROS1, 
RET, MET, HER2, BRAF, and KRAS~—are very similar 
to those listed in the guidelines published by 
the National Comprehensive Cancer Network 
(NCCN), an alliance of American cancer centers 
(8, 9). However, there are notable differences. The 
EGFR mutation rate, for instance, is around 35% 
in Chinese patients, and only 15% in Caucasian 
patients, while the KRAS mutation is around 5% 
to 10% in Chinese lung cancer patients versus 
30% in Caucasians. There is no obvious difference 
in prevalence between populations for other 
mutated genes that occur with lower frequency 
(Figure 1, next page). 

EGFR mutations were identified as a focus for 
targeted therapy for lung cancer in 2004. With 
the first report of EGFR mutations in the Chinese 
population and preliminary data about the 
response rate to TKls, Chinese investigators Tony 
Mok, Yi-Long Wu, and others led a milestone 
phase 3 randomized study, named the Iressa Pan- 
Asia Study (IPASS), published in the New England 
Journal of Medicine in 2009 (1). The study showed 
for the first time that an EGFR TKI was effective 
in treating EGFR mutant cancers. Based on IPASS 
and follow-on studies, EGFR TKI was established 
as a first-line standard treatment for patients with 
EGFR mutated tumors, and was incorporated into 
the treatment guidelines of numerous international 
cancer organizations including NCCN, the 
American Society of Clinical Oncology (ASCO), 
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and the European Society for Medical Oncology 
(ESMO). In 2007, oncologist Yi-Long Wu, who is 
also on the board of directors of the International 
Association for the Study of Lung Cancer (IASLC) 
and president of the Chinese Society of Clinical 
Oncology (CSCO), founded the Chinese Thoracic 
Oncology Group (CTONG). CTONG now has 31 
member hospitals across China and has become 
an important platform for organizing trials and 
translational research studies. It contributed 
significantly to the development of targeted 
therapies in China, not only participating in several 
international synchronized trials, but also taking 
the leading role in dozens of national studies. 

All the above studies bolstered the registration 
and approval of targeted therapies in China as well 
as the capacity for designing and organizing trials. 
The CFDA has now approved three first-generation 
EGFR TKls (gefitinib, erlotinib, and icotinib), one 
second-generation EGFR TKI (afatinib), and a third- 
generation EGFR TKI, osimertinib, for EGFR mutant 
tumors. Crizotinib, aimed at Chinese patients who 
have ALK or ROS7 mutations, was also approved by 
the CFDA. Other ALK TKls are in clinical trials led 


by investigators from CTONG, while TKls targeting 
numerous other mutated genes are in different 
stages of clinical trials or already in use in clinical 
settings. 


Novel trial design for oncology 

A pragmatic question faces developers of 
targeted therapies: How can they ensure that a 
sufficient number of patients with specific genetic 
alterations are enrolled in the clinical trials for each 
drug? The idea of simultaneously screening pa- 
tients for a set of genetic biomarkers linked to the 
disease and matching the drugs under investiga- 
tion to each specific genetic or pathway alteration 
has now been adopted worldwide. 

Yi-Long Wu, as the president of CSCO, has 
advocated this kind of trial design for many years, 
attempting to push molecular genotyping and 
targeted therapies into clinical practice. In 2015, 
Wu launched the CLUSTER 1.0 trial, which is the 
first multiple-biomarker-driven clinical research 
study to be conducted in Asia (10). The enrolled 
patients were assigned to groups receiving 
different agents according to their genetic 
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profiles. Five agents targeting different molecular 
abnormalities are being studied based on the 
mutation(s) found, making up the five arms of the 
study. Each arm will be analyzed independently. 
lf the response rate to an agent reaches 40%, a 
phase 2 trial will be conducted. This multiple- 
arm, parallel assignment is intended to reduce 
the overall drug screening failure rate. This 

study drew much attention from the Center for 
Drug Evaluation of the CFDA, and is a significant 
milestone for precision medicine in China. Design 
of the follow-up CLUSTER 2.0 lung cancer trial 

is complete, and will be launched in the fourth 
quarter of 2017. CLUSTER 2.0 is also biomarker- 
driven and will evaluate novel targeted agents in at 
least 10 study arms focused on specific genetic or 
pathway alterations. 

During the process of developing new antican- 
cer agents such as icotinib, avitinib, and apatinib, 
Chinese researchers have already had experience 
with innovative clinical trial design. One example is 
a phase 1/2 study that aims to determine the safety 
profile and recommended phase 2 dose (RP2D) 
of avitinib in patients carrying a particular EGFR 
mutation. Instead of using only the maximum toler- 
ated dose, as is generally done, in this study RP2D 
was determined by the pharmacokinetics, prelim- 
inary efficacy, and safety profile of the agent. This 
innovative design established a new paradigm 
that allows the testing of a wider range of dosages 
showing indications of efficacy and low toxicity at 
an early stage. Based on the expedited determi- 
nation of RP2D, in 2016 researchers applied for 
approval from the CFDA to conduct a phase 2/3 
trial with avitinib—the fastest-ever application for a 
novel agent in China. 

Currently, the CFDA has approved only five EGFR 
TKls (gefitinib, erlotinib, icotinib, afatinib, and 
osimertinib) and one ALK/ROS1 TKI (crizotinib). 

No drugs are clinically available for lung cancer 
patients with other common mutations. Therefore, 
to be enrolled in a new trial offers such patients a 
way to gain access to precision medicine. Indeed, 
through these ongoing innovative clinical trials, 
patients with actionable genomic alterations can 
quickly access drugs under study internationally, 
as well as drugs approved in the United States, 
Japan, or Europe (but not in China). 


Guidelines for biomarker testing, diagnosis, 
and treatment 

In precision oncology, molecular diagnosis pre- 
cedes precise treatment. EGFR and ALK testing, 
and their corresponding treatments, have been 


included in the guidelines of numerous cancer 
associations, including NCCN, ASCO, and ESMO. 
IASLC has released the EGFR and ALK testing atlas- 
es, a series of consensus documents on molecular 
testing (17). 

In China, local guidelines for different tumor 
types have been formulated. Since 2016, CSCO 
has taken the lead in compiling the annual Guide- 
lines for Diagnosis and Treatment for patients with 
lung and other cancers. The Ministry of Health and 
Family Planning Committee, formerly the Minis- 
try of Health, also organized national experts to 
produce Clinical Path and Guidelines for Diagnosis 
and Treatment for Primary Lung Cancer Patients. 
Yi-Long Wu played a leading role in compiling 
both of these guidelines as they pertain to lung 
cancer. In the CSCO guidelines, due to the limited 
availability of drug and testing technologies, and 
disparities and imbalances in resources distri- 
bution, only EGFR mutations and ALK and ROS1 
rearrangements and their targeted drugs are listed 
for routine clinical practice. Circulating tumor 
DNA from plasma is regarded in the report as an 
alternative biomaterial to direct tumor sampling 
for screening of EGFR mutations, but its potential 
for producing false negatives is emphasized. The 
national and local guidelines for testing, diagnosis, 
and treatment are modified annually to account for 
progress and new evidence from trials. 

Several EGFR and ALK gene testing kits have 
been approved by CFDA and are also recommend- 
ed by national or local guidelines. According to 
one as-yet unpublished report, the overall rate of 
testing for EGFR mutations in patients in China 
is around 50%, which although higher than the 
previous rate, is still not on par with the rate in 
developed Western countries, Japan, Korea, and 
Singapore. 

Many leading hospitals carry out molecular testing 
in their own laboratories under license from the 
Ministry of Health and Family Planning Committee, 
using CDFA-approved kits or tests developed in- 
house. Some hospitals, especially those with fewer 
resources, outsource molecular testing to third- 
party laboratories. As sophisticated next-generation 
sequencing (NGS) technology comes into wide use, 
it is reported that many companies are emerging 
across the country to deliver that service. The CSCO 
biomarker committee, led by Yi-Long Wu and Xu- 
Chao Zhang, is paying attention and making efforts 
to standardize the clinical use of NGS technology 
and to ensure testing quality for clinical oncology 
practice. In a draft of Consensus of the Use of NGS 
in Clinical Oncology, to be released in the near 


future, NGS technology is regarded as a complex 
genotyping tool for which quality control is critical. 
In addition, diagnostic kits based on NGS used to 
stratify patients with specific gene mutations are 
currently being developed by many companies. 
They will be validated as companion diagnostic 
tools in the CLUSTER 2.0 trial. 

For patients with driver genes or genetic 
alterations other than EGFR, ALK, or ROS17, the 
CSCO guidelines recommend participation in 
trials in which investigational drugs are provided. 
Patients with disease relapse or those who are 
resistant to the available TKls are encouraged to 
undergo additional molecular screening to clarify 
the resistance mechanism and to participate in 
clinical trials pertinent to their diagnosis. 

Precision medicine in China has made 
substantial progress in the treatment of lung 
cancers harboring EGFR, ALK, and ROS1 
alterations. Chinese investigators have 
contributed their efforts to the global and national 
establishment of standard targeted therapies. Yet 
if precision oncology is to improve, more effort is 
needed to improve the detection rate of key driver 
genes like EGFR, to develop new drugs, to launch 
novel trials, and to standardize molecular testing. 


Immunology of cancer 

Internationally, immune therapy has been 
recognized as a potentially powerful treatment 
since the establishment of the field of immuno- 
oncology more than 100 years ago. In the United 
States, Europe, and Japan, immune checkpoint 
inhibitors—such as drugs that target CTLA-4, PD-1, 
and PD-L1, proteins known to inhibit T cells from 
fighting cancer—have already been approved 
by regulatory agencies for a variety of cancer 
types including melanoma and lung cancer. 
By the end of December 2016, the U.S. Food 
and Drug Administration (FDA) had approved 
therapies that block the PD-1 pathway, including 
nivolumab, pembrolizumab, and atezolizumab. 
The working principle of this kind of inhibitor is 
to remove the brakes on immune activation or 
block the inhibition of immune attack in a tumor’s 
microenvironment. Antibodies against other 
immune checkpoints like TIM3 and IDO are in the 
preclinical or clinical stage of development. 


Immune therapy for lung cancer 

In China, no PD-1 or PD-L1 inhibitors have been 
approved by CFDA. The contrast with the United 
States indicates that Chinese R&D in immuno- 
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oncology biologics (protein-based drugs) lags 
behind, as it does in the area of small-molecule 
drugs. However, various clinical trials are actively 
investigating both international and domestic 
drugs in Chinese patients with a range of cancer 
types including lung cancer. 

Between January 1, 2013 and April 6, 2017, 
ClinicalTrials.gov registered 270 international 
clinical trials of PD-1/PD-L1 therapies for NSCLC, 
including trials for nivolumab, pembrolizumab, 
atezolizumab, and durvalumab. These 270 trials 
included 61 studies that involved East Asian sites 
and 14 that involved Chinese sites, 12 of which 
were multinational trials and 2 that evaluated only 
Chinese patients (12). 

In the 14 international trials that enrolled 
Chinese patients, six involve first-line/primary 
therapies, four involve second-line therapies 
(used if primary treatments fail), two are studies of 
adjuvant therapies, and two are phase 1 studies 
using only Chinese patients. There are five studies 
for atezolizumab, four for durvalumab, three 
for pembrolizumab, and two for nivolumab. It is 
noteworthy that PD-L1 expression was set as a 
criterion for recruitment in some trials, but not in 
others, and that different diagnostic antibodies 
and screening platforms were used in this 
determination. 

Nonuniform PD-L1 expression testing and 
cutoffs make it extremely difficult to compare 
results from different trials. Nonetheless, it 
is encouraging that Chinese pharmaceutical 
companies are working intensively to develop 
PD-1 and PD-L1 drugs. With the efforts of CFDA 
to reform the regulatory framework for clinical 
trial approval of novel agents and to adjust its 
policies to respond to advances from international 
clinical trials, immunotherapy trials are now 
getting underway faster. Chinese pharmaceutical 
companies had developed eight PD-1 or PD-L1 
inhibitors by January 7, 2017, and four drugs have 
been approved by CFDA for phase 1 trials among 
patients with advanced solid tumors including 
NSCLC. Another four drugs are now being 
considered by CFDA for clinical trial approval. 

In November 2016, the PD-1 inhibitor KNO35, 
which is administered by subcutaneous injection, 
received approval from the U.S. FDA for clinical 
trials. 

Notably, the immuno-oncology market is very 
competitive for domestic Chinese pharmaceutical 
companies. As of October 2017, there are 17 
inhibitors targeting immune checkpoints PD-1, 
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PD-L1, or CTLA-4 from 14 domestic corporations from 20% to 40%-60%, a smaller increase than 
treating a range of cancers (Table 1). expected (73). However, testing for PD-L1 protein 
expression by immunohistochemistry (IHC) was 
Improving patient selection for immune performed using different platforms, antibodies, 
checkpoint inhibitor therapy and scoring methods, making it difficult to compare 
PD-L1 protein expression on tumor cells or im- results from different trials. Hirsch and colleagues 
mune cells in the tumor microenvironment has been reported that in one of the studies, the discrep- 
widely used as a companion diagnostic, or eval- ancies caused by testing for PD-L1 on different 
uated retrospectively in clinical trials. The overall platforms could result in misclassification of PD-L1 
response rate to PD-L1 inhibitor therapy in patients status (74). 
selected for using PD-L1 expression levels increased In China, no PD-L1 protein expression tests or kits 
have been approved by CFDA. No 
Company MoA Drug/Asset PD-L1 test is recommended in the 
CSCO guidelines for any cancer type. 
Jiangsu Hengrui Medicine PD-1 SHR-1210 In contrast, a PD-L1 IHC test has been 


approved by the U.S. FDA. In ongoing 
trials in China, a PD-L1 test is taken as 


PD-L1 SHR-1316 ae . fos : 
one criterion for inclusion in certain 
trials but not in others. Whether 

Shanghai Junshi Bioscience PD-1 JSO01 PD-L1 should be used as a biomarker 
to select Chinese patients for PD-L1 
Innovent Biologics PD-1 GB226 inhibitor treatment requires further 
investigation. 
Are there any potential biomarkers 
BeiGene PD-1 BGB-A317 other than PD-L1 to be developed 
for the Chinese population? 
Genor BioPharma PD-1 IBI308 Theoretically, molecules in the 
microenvironment of a tumor and 
3DMed SpL KNO35 in the circulating blood could 


potentially be predictive markers. 
Such molecules could be involved in 
CTLA-4 KN044 any one of the many steps between 
the appearance of an antigen and 
the launch of an immune attack on 


Harbin Gloria Pharmaceuticals PD-L1 GLS-010 ; 
tumor cells. Biomarkers such as tumor 
mutational burden, DNA mismatch 
CStone Pharmaceuticals PD-L1 Anti-PD-L1 repair status, high microsatellite 
instability, and circulating T-cell 
Akeso Biopharma PD-1 AK101, AK103 characteristics are under extensive 
investigation in trials using several 
. ; cancer types. 
Bio-Thera PD-1 Anti-PD-L1 


In ongoing clinical trials in 
China, investigators have a good 
Livzon Pharmaceutical PD-1 Anti-PD-L1 opportunity to study these 
biomarkers by testing biomaterials 
collected from tumor and blood 


Lee’s Pharma PD-1 Anti-PD-L1 ; 
samples. In one study (with the 
acronym CHOICE) led by Yi-Long Wu, 
Sichuan Kelun PD-L1 KL-A167 researchers looked at the immune 
scores of 250 heavy smokers with 
Shanghai Henlius Biotech PD-1 Anti-PD-L1 lung cancer by analyzing the RNA 
sequencing data of the immune cells 
TABLE 1. Summary of domestic pharmaceutical companies with R&D in immuno- isolated from the tumors. Preliminary 


oncology. Source: www.chinadrugtrials.org.cn. MoA, mechanism of action. data showed a correlation between 


immune score and disease prognosis (unpublished 
data). This RNA-sequencing-based method can be 
further evaluated in clinical trial patients to see if 
the immune scores can be predictive for positive 
response to immunotherapy. It is important to 
note that some biomarkers require sophisticated 
technologies like multicolor IHC, whole-exome 
deep sequencing, or the use of cancer gene panels. 
One other challenge is how to integrate several 
different kinds of immune-related biomarkers, for 
example, combining tumor mutational burden with 
PD-L1 expression. 

Overall, the development of novel biomarkers to 
predict and guide immune therapy using PD-1/ 
PD-L1 inhibitors needs further exploration, par- 
ticularly in trial patients who can provide ongoing 
samples and whose treatment and follow-up data 
can be monitored. 


Summary and future directions 

Over the past 10 years, the practice of clinical 
oncology in China has gradually incorporated the 
attributes of precision medicine, as is happening 
in other countries. There has also been significant 
progress in clinical research and drug development. 
Chinese investigators have contributed in important 
ways to international studies, and have conduct- 
ed trials based on the particular characteristics of 
Eastern populations. A series of novel targeted 
agents with good efficacy has been synthesized and 
introduced for clinical use. In the arena of immune 
therapy, dozens of national and international trials 
are underway in the hope of moving PD-1/PD-L1 
inhibitors into clinical use. Eight such inhibitors from 
local Chinese companies are in the early stages of 
clinical development for use against a variety of 
cancer types. And a handful of other small-molecule 
TKlIs and immune checkpoint inhibitors are under- 
going validation in clinical trials. National and local 
guidelines for clinical practice have been written in 
recent years to standardize the use of these effec- 
tive new treatments. 

Nonetheless, China is lagging behind in some 
areas, including development of novel drugs, avail- 
ability of TKls and antibody drugs that have already 
been approved in other countries, development of 
companion diagnostics, and availability of clinical 
biomarker tests. Furthermore, many domestic novel 
anticancer agents are structural modifications of 
agents that are already on the market or are under- 
going research; innovative design is rare. 

However, there are many possible future direc- 
tions. The pharmaceutical industry should shift its 
focus from generic agents to the development of 
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pioneering agents. New technologies like NGS and 
liquid biopsies of ctDNA can be effective ways to 
push innovative biomarker-driven trials forward, 
especially for those patients with rare genetic 
alterations in cancer-related genes. Attention 
should also be paid to the mechanisms that cause 
resistance to EGFR and ALK TKls, and to drugs that 
can overcome this resistance. More translational 
studies on immunotherapies including checkpoint 
inhibitors and adoptive cell therapy need extensive 
exploration. Finally, development of biomarker 
companion diagnostics and standardization of 
biomarker tests in a clinical setting should be ad- 
dressed. 

It is well known that much effort is needed to 
improve drug development by local companies, as 
well as the competitiveness of those companies. 
The Chinese government has policies to encourage 
innovative R&D on new drugs and to incubate pio- 
neering startups in China. Influenced by the Preci- 
sion Medicine Initiative declared by former Amer- 
ican President Obama and the Cancer Moonshot 
Initiative led by former Vice President Biden, China 
initiated a set of national science and technology 
programs designed to stimulate R&D on precision 
medicine in oncology. 

To further drive precision oncology forward, all 
stakeholders, including the government, academic 
investigators, pharmaceutical companies, innovative 
biotechnology companies, patients, and disease 
advocates should come together and bring their 
resources to bear. Only this way will we be able to 
bring precision medicine to the patients who need 
it most. 
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The explosive growth of medical 
and 'omics big data 

The biomedical field is experiencing an era of sig- 
nificant growth due to the large amount of clinical, 
biomolecular, and health data being collected. The 
increased adoption of electronic medical record 
systems worldwide has enabled large volumes 
of clinical data to be captured and stored. These 
data include screening, diagnostic, and treatment 
results as well as medical and family histories, 
and data from biochemical testing, imaging, and 
follow-ups. Furthermore, advanced technologies 
that collect biomolecular data from studies in ge- 
nomics, transcriptomics, proteomics, epigenomics, 
metabolomics, metagenomics, and exposomics— 
collectively termed “omics”—can provide results 
within hours or days, dramatically speeding up clini- 
cal analysis. "Multi-omics” molecular data are widely 
used in the field of precision medicine and related 
clinical applications. These data constitute one of the 
fastest-growing data types in biomedicine. More- 
over, a large amount of daily biological data related 
to personal health is being continuously collected by 
wearable devices and smartphones. These multidi- 
mensional data require efficient methods of storage 
and analysis if they are to be useful tools for explor- 
ing the mysteries of disease and health. 


Large cohort studies in Chinese 
precision medicine 

As a consequence of major advances in gene 
sequencing technology, programs promoting 
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precision medicine have been launched worldwide, 
with the aim of improving accuracy in screening, 
diagnosis, intervention, and treatment. Scientists 
around the world, but particularly in the United 
States, the United Kingdom, and China, are 
collecting the genomic data of millions of people 
to establish a database of genetic variants. In 2016, 
the Chinese Ministry of Science and Technology 
founded the Precision Medicine Research Key 
Special Project (PMRKSP) and announced an 
investment of billions of Chinese yuan before 2030. 
At present, three types of large cohort studies 
have been launched as part of this effort. The 

first is the study of natural national populations, 

in which millions of participants from seven main 
regions covering four municipalities and 24 
provinces of China will be analyzed. The second 

is a study of eight cohorts affected by critical 
diseases, including cardiovascular, cerebrovascular, 
respiratory, metabolic, neurological, 
psychosomatic, and immune system disorders 

as well as seven common malignant tumors. A 
primary goal of this study is to collect samples from 
a total of 700,000 people. The third investigation 

is a clinical cohort study of 50 rare diseases that 
aims to collect more than 50,000 patients in a 
single cohort. In addition, the Chinese Academy 

of Sciences (CAS) has announced a precision 
medicine research plan targeting the collection of 
genetic information from 4,000 Chinese volunteers 
within four years. 


From big data to knowledge 

The meticulous analysis of big data is essential to 
extract accurate information and ultimately convert 
them into knowledge (7) that can assist in clinical 
decision-making and health management (Figure 1, 
next page). In order to support this transformation, 
the Chinese government has developed a series 
of plans that focus on key technologies involved in 
processing large amounts of data. The 13th Five- 
Year National Science and Technology Innovation 
Plan, issued by the State Council, will develop 
technologies for precision medicine, integrate 
them into a multilevel knowledge database, and 
create a national platform to share biomedical 
big data. PMRKSP has also initiated a series of 
research projects that focus on the most important 
technologies for big data integration, storage, use, 
and sharing. The program has taken three steps 
toward transforming data into information, and 
information into knowledge. This program first 
started to build fundamental data infrastructure 
(data management, processing systems, and 
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FIGURE 1. Research and applications of big data in precision medicine. 


disease databases) in 2016. At the beginning of 
2017, scientists began processing the raw data 
generated from PMRKSP into multilayered 
biomolecular information repositories, which 
assist in the integration of detailed standardized 
information and metadata, including clinical 
patient information and its associated genomic, 
proteomic, metabolic, and other 'omics data. 
Finally, by the start of 2018, knowledge mining 
will begin on the results of the first two stages of 
work, with the purpose of discovering important 
genetic diseases, key mutation sites, and 
epigenetic information. This project will facilitate 
the implementation of research in precision 
diagnosis and the treatment of disease, and will 
lead to important developments in precision 
medicine. 


Construction of big data platforms 
for precision medicine 

With the support of a national policy concerning 
big data platforms, China has been able to construct 


a comprehensive infrastructure to successfully 
implement precision medicine across both state- 
owned and private health care sectors. Under the 
guidance of China’s National Health and Family 
Planning Commission, a new company known as 
China Healthcare Big Data Development Co., Ltd. 
(CHBDDC) was established. CHBDDC designated 
two provinces and four cities to participate in a 
pilot program of national health care big data 
centers. The China National GeneBank (CNGB), 
whose establishment was approved by the 
National Development and Reform Commission, 
was launched in 2016 and will store and manage 
the country’s unique genetic resources. In 
addition to the national big data centers, domestic 
biological big data platforms such as BGI Online 
(www.bgionline.com) and JingYun (www.gene. 
ac) have been built to provide fast and accurate 
computing services in bioinformatics. Information 
technology cloud service providers, including 
Alibaba and Huawei, provide specialized cloud 
storage and computing services for biomedical 
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data, which allow information technology and 
biotechnology to be closely integrated. 


The potential of medical big data 

Medical and 'omics big data is improving health 
care efficiency and quality by transforming data 
into knowledge (2-4). There are three aspects of 
big data that best describe its potential value. 
Firstly, the data can describe the entire process of 
a disease by finding the hidden associations and 
patterns within complex data, uncovering molecu- 
lar mechanisms, revealing biomarkers, identifying 
disease-associated genetic factors, and evaluating 
therapeutic effectiveness. Secondly, the genomic 
and clinical data can be used to predict the risk of 
genetic diseases, the occurrence of cancers, and 
the clinical outcome of treatments aimed at novel 
disease targets. Finally, big data can help doctors 
to prevent disease by prescribing personalized 
health care regimens using multidimensional data 
from individuals as well as real-time health and 
disease information available through the dynamic 
collection of data acquired daily through personal 
health monitoring devices. 


Challenges of big data use in precision 
medicine 


Data sharing 

To maximize the contributions of big data, gen- 
erate reproducible results, and improve research 
practice, it is essential to allow data to be open and 
shared among different institutions (3). PMRKSP 
requires that data from clinical studies must be 
unconditionally shared through a national big data 
center. However, a mutually beneficial alliance is 
necessary to encourage research institutions and 
businesses to share key data generated outside of 
PMRKSP. 


Security and privacy of data 

In the future, a large volume of multi-omics infor- 
mation on populations and electronic medical and 
health records will be stored in big data platforms. 
If personal privacy is not guaranteed, ethical issues 
are bound to arise. To avoid privacy disputes, the 
government should formulate strong data protec- 
tion laws. Additionally, methodologies that do not 
reveal an individual's information, such as multipar- 
ty computation, are indispensable and should be 
established (5). 


Standardization of data generation and analysis 
Currently, clinical research based on data from 


next-generation sequencing (NGS) lacks standards 
and specifications for sample collection, storage, 
handling, and analysis. These issues lead to poor 
data quality, errors in analysis, and misinterpre- 
tation of results. In 2016, the Chinese Society 

of Clinical Oncology released the first draft of 
“Consensus on Second Generation Sequencing 
Technology Applied in Precision Diagnosis and 
Treatment of Clinical Cancer” in order to promote 
the standardization of NGS in clinical testing. The 
following year, experts from West China Hospital 
of Sichuan University and Peking Union Medical 
College Hospital published the “Expert Consen- 
sus on Second Generation Gene Sequencing in 
Clinical Molecular Pathology Laboratories.” These 
documents will greatly accelerate standardization 
of the generation and interpretation of NGS data. 
However, more informative industry standards are 
needed to ensure the reliability and reproducibility 
of precision medical data. 


Integration of multidimensional 'omics data 

To overcome the deficiencies of the single ‘omics 
approach, it is crucial to introduce redundancy by 
generating multi-omics data that allow information 
on different types of biomolecular interactions to 
be extracted. In addition, expanding the scope of 
the methods used to collect data, such as includ- 
ing data from noncoding DNA regions, will further 
enhance the power of the big data analyses being 
performed. 


Deep integration with information technology 
Medical big data raise significant challenges in 
storage and computation, which urgently require 
innovative information technology approaches 
to provide solutions. A key problem associated 
with big-data knowledge mining is the rapid rate 
at which the size and complexity of the informa- 
tion network expands, resulting in bottlenecks in 
computing speed. One resolution to this issue has 
recently been offered by the Chinese technology 
company Hanwuji Intelligence, which improved 
computing speed by using a dedicated proces- 
sor that analyzes data through a type of machine 
learning known as “deep learning” (6). This solution 
suggests that resolving the unique problems asso- 
ciated with medical big data processing will require 
a range of cutting-edge information technology, 
potentially requiring dedicated hardware and soft- 
ware to be embedded in the big data platform. 


Future trends in big data for precision medicine 
Future big data technology will be closely inte- 
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grated with artificial intelligence (Al) technology 

to promote the development and application of 
precision medicine. The clinical application of IBM's 
Watson supercomputer has demonstrated that Al 
can enable clinical decisions to be made quickly 
and accurately through the integration and anal- 
ysis of big data. At present, China has insufficient 
medical resources and is facing problems with an 
irrationally structured medical system as well as 
unbalanced distribution of its medical resources. 
This situation makes it difficult for citizens to seek 
medical treatment, as is explained by the State 
Council in its position paper Key Tasks to Deepen 
the Reform of the Medical and Health System in 
2016. To address these issues, the government is 
establishing diagnosis and treatment guidelines, 
strengthening basic services, and promoting better 
allocation of high-quality medical resources for 
Chinese citizens. The intelligent diagnosis and 
treatment system constructed with big data, in con- 
junction with Al, will incorporate the experience of 
experts, effectively enhancing the quality of grass- 
roots services available and addressing the deficit 
in the availability of medical resources in China. 
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The role of circulating 
cell-free DNA in the 
management of cancer 
in China 


Ying Hu", Yanhui Chen“;, Lei Zhang2?, 
Haitao Zhao?, Hui Zeng”, and Henghui Zhang?" 


Clinical needs in the management of cancer 
in China 

Cancer is a major public health problem in China. 
Wanqing Chen and colleagues reported that an es- 
timated 4.9 million new cancer cases and 2.8 million 
cancer deaths occurred in China in 2015, with lung 
cancer being the most common and the leading 
cause of cancer death (7). Antitumor treatments 
currently being developed in China aim to improve 
overall survival and quality of life for patients. 


Some of the most important clinical needs for 
effectively managing cancer include: 

1. Biomarkers that are sensitive and specific to 
particular kinds of cancer cells, enabling earlier 
diagnosis and identification of the tissue of origin 
for unknown metastases and helping scientists 
distinguish between benign and malignant tumors. 

2. Real-time tracking of biomarkers that can assist 
in clinical decision making, predict the safety and 
efficacy of atherapy, monitor the patient's response 
to a therapy, allow early detection of any resistance 
to treatment, and aid in prognosis. 

3. Biomarkers that can define the molecular 
characteristics of a cancer and enable monitoring 
of these characteristics as the cancer progresses or 
shrinks following treatment. 

Cell-free DNA (cfDNA)—nucleic acids not 
bound inside cells but circulating in the plasma— 
was detected in the bloodstream of patients 
with cancer as early as 1948 (2). Clinical studies 
indicated that plasma cfDNA could be used as a 
biomarker for diagnostic screening, predicting 
responses to therapy, monitoring the size of the 
tumor, and diagnosing a relapse at an early stage 
(3-5). 
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cfDNA in cancer diagnosis 

Circulating cf(DNA as a noninvasive biomarker 
has been used for early diagnosis and monitoring 
of tumor burden (6, 7). Several studies have shown 
that levels of cfDNA are higher in cancer patients 
than in healthy people (8-10). However, the use of 
circulating cfDNA in early cancer diagnosis has some 
limitations, since it is thought to originate from both 
tumor cell DNA and normal cell DNA. In early stage 
tumors, the low abundance of tumor-derived cfDNA 
may make detection difficult. Somatic mutations— 
nonheritable genetic alterations—found in cfDNA 
may provide information to help distinguish benign 
from malignant tumors. In practice, some variants 
identified in gene tests are difficult to classify clinical- 
ly as pathogenic or benign, making it challenging to 
clinically distinguish benign from malignant tumors. 


Using cfDNA to identify the tumor origin 

Recent studies have demonstrated that epigen- 
etic information embedded in cfDNA can provide 
information useful for predicting a tumor’s tissue 
of origin (71, 12). Guo and colleagues performed a 
systematic search of tissue-specific methylation re- 
gions across the entire human genome. Using a ref- 
erence database of known methylation signatures, 
the authors identified tissue-specific methylation 
markers and demonstrated accurate determination 
of tissue of origin from cfDNA from patients with 
lung or colorectal cancer (17). 


Using cfDNA in selecting patients 
for targeted therapy 

cfDNA has been used in clinical practice to 
stratify non-small cell lung cancer (NSCLC) patients 
for targeted therapy. The U.S. Food and Drug 
Administration (FDA) and the European Medicines 
Agency (EMA) have approved the use of epidermal 
growth factor receptor (EGFR) mutations detected 
in circulating cfDNA to select NSCLC patients 
with EGFR-actionable mutations for EGFR tyrosine 
kinase inhibitor (EGFR-TKI) therapy. However, this 
presents challenges for clinical decision-making, 
including the detection of discordant mutations 
in the tumor tissue compared to circulating 
tumor DNA (ctDNA). A large retrospective study 
of Chinese NSCLC patients showed that among 
2,463 NSCLC patients with matched tumor tissue 
(T) and ctDNA (C) specimens, 1,017 patients 
carried the EGFR mutation in tumor tissues and/ 
or ctDNA. Of these patients, 472 received EGFR- 
TKI treatment and were divided into three groups 
based on whether they carried the mutation in the 
tumor, in circulating cfDNA, or both: T*/C* (n=264), 


T-/C* (n=28), and T+/C- (n=180). The median 
progression-free survival across the groups 
were similar. Further PCR and next-generation 
sequencing (NGS) validation from microdissected 
surgical specimens suggested that intratumor 
heterogeneity and relatively low sensitivity of 
the mutation detection assay contributed to 
discordant EGFR mutant status between tissues 
and ctDNA. In fact, neither tissue nor ctDNA 
analysis cover all EGFR mutations, suggesting 
that a combination of data from both tissue and 
ctDNA might be needed to accurately detect the 
presence of EGFR mutations. This study suggested 
that primary screening using ctDNA, followed 
by tumor tissue analysis in the case of a negative 
result, may be a preferable approach to determine 
whether EGFR-TKI therapy is warranted (73). 
Previous studies have detected RAS mutations in 
ctDNA, using this information to stratify metastatic 
colorectal cancer (mCRC) patients for cetuximab or 
panitumumab treatment (4, 14). Recently, Xu and 
colleagues found a novel phosphatidylinositol- 
4,5-bisphosphate 3-kinase catalytic subunit alpha 
(PIK3CA) mutation (p.K944N) in patients with 
resistance to cetuximab. These results indicated 
that patients harboring PIK3CA or RAS mutations 
in ctDNA showed shorter progression-free survival 
than those with wild-type sequences (15). 


Role of cfDNA in monitoring clonal evolution 

The early detection of relapse following prima- 
ry treatment for cancer and characterization of 
emerging molecular subsets might offer new ther- 
apies to limit tumor recurrence. ctDNA analysis is a 
potentially powerful method to noninvasively track 
tumor clonal evolution. Results from the TRACERx 
[TRAcking Cancer Evolution through therapy (Rx)] 
lung cancer study showed that phylogenetic cfDNA 
profiling closely tracks the subclonal nature of lung 
cancer relapses and metastases. It also found that 
the tumor volume was correlated with the variant 
allele frequency determined from plasma cfDNA. 
cfDNA profiling can allow for tracking of tumor 
evolutionary dynamics and detection of adjuvant 
chemotherapy resistance in NSCLC (5). Recently, 
Jiang et al. reported that the mutational landscape 
of cfDNA is associated with therapeutic response 
to first-line, platinum-based doublet chemotherapy 
in patients with advanced NSCLC (16). 

The CRC patients with wild-type RAS could be 
stratified based on the tumor tissue genotype 
(17). However, RAS mutations often occur after 
EGFR blockade treatment, and it is challenging to 
obtain repeated tumor biopsy tissue to track tumor 


evolution. Siravegna et al. observed RAS clonal 
evolution in circulating cfDNA of mCRC patients. 
Interestingly, they found that the abundance of 
mutated KRAS clones in plasma increased during 
EGFR blockade treatment, but decreased following 
withdrawal of EGFR-specific antibodies (4). 


cfDNA in immune checkpoint therapies 

Recent clinical data support the use of new 
immune checkpoint inhibitors in several types of 
cancer. These drugs interfere with proteins that 
prevent the immune system from attacking cancer 
cells, and include such drugs as anti-PD-1 and anti- 
PD-L1 antibodies (78-22). A number of biomarkers 
have been developed to stratify patients likely to 
benefit from these immune checkpoint blockers. 
Their use has improved the survival of cancer 
patients and reduced treatment costs. Among 
these biomarkers, the PD-L1 protein is the most 
well known. Numerous clinical trials suggest that 
patients expressing PD-L1 are more likely to benefit 
from treatment with PD-1/PD-L1 inhibitors (23-25). 
However, these studies showed that approximately 
10% of PD-L1-negative patients can also benefit 
from PD-1/PD-L1 inhibitors, suggesting that there 
might be other biomarkers that can better predict 
drug efficacy (23-25). 

Several studies have reported other biomarkers 
that can also predict drug efficacy. A deficiency 
in DNA mismatch repair (dMMR) is a classical 
biomarker of colorectal and other cancers, and is 
associated with the efficacy of chemotherapy. High 
microsatellite instability (MSI-H) or dMMR leads to 
the accumulation of genetic mutations that may 
increase the risk of immune system identification 
and destruction of the tumor, suggesting that 
patients with dMMR progressive tumors can 
benefit from treatment with pembrolizumab or 
other PD-1 inhibitors (20, 26). Recent studies 
identified the mechanisms of acquired immune 
resistance after anti-PD-1 monoclonal antibody 
treatment and found two specific genetic variants 
in JAK1/JAK2 and beta-2-microglobulin to be 
involved (27, 28). A good response to immune 
checkpoint inhibitor treatment was positively 
associated with significantly increased tumor- 
infiltrating CD8 T cells or T-cell receptor (TCR) 
diversity (29, 30). In these studies, comparison of 
TCR clonality at baseline and postdosing biopsies 
showed that samples from responders had over 
10 times as many clones expand after anti-PD-1 
therapy; peripheral blood TCR diversity increased 
in responders after anti-CTLA-4 therapy. Peripheral 
blood T cells are relatively easy to obtain from 
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cancer patients, so TCR diversity testing might be a 
good predictor of response to immune checkpoint 
inhibitors. In NSCLC, the amount of tumor variant 
load or tumor mutational burden (TMB) can impact 
the efficacy of immune checkpoint inhibitors (such 
as anti-PD-1 antibody) (31-36). 

Biomarkers such as those mentioned above, 
which can predict the efficacy and prognosis of 
immune checkpoint inhibitor treatments, are found 
in many tumor tissues. It is well known that the 
predictive efficacy of biomarkers from tumor tissue 
is limited because of the difficulty of obtaining 
samples from advanced-stage cancer patients 
and because of tumor heterogeneity (37). Clinical 
studies have shown that cfDNA can effectively 
reflect the tumor’s size, malignant state, and 
ability to metastasize. It can also provide real-time 
information on causative mutations, providing 
more comprehensive gene mutation data than 
can be derived directly from tumor tissues (38). 
Researchers at the 2017 American Society for 
Clinical Oncology conference reported that TMB 
data obtained from sequencing of cfDNA could 
effectively predict those lung cancer patients 
who would benefit most from anti-PD-1 antibody 
treatment (39). In this study, ctDNA TMB was 
associated with a significantly higher number of 
DNA repair mutations. Additionally, smoking was 
associated with a higher TMB score. However, 
lower TMB detected in ctDNA predicted that 
a subset of patients would respond better to 
checkpoint inhibitors, in contrast to results from 
other studies (31-36). Potential reasons for this 
result include the small sample size, the possibility 
that ctDNA does not accurately reflect tumor 
burden, and the limited length of DNA sequenced 
(78,000 bp-138,000 bp). Larger prospective 
studies are necessary to validate these findings. 

In the field of commercial gene testing, NGS- 
based MSI-H detection in cfDNA samples has been 
used to guide treatment choices, such as whether 
to use anti-PD-1/PD-L1 antibodies. In China, lung 
cancer, colorectal cancer, and cholangiocarcinoma 
are highly prevalent, providing a good 
opportunity to apply MSI-H and TMB detection 
assays. In addition, since the mechanism of 
tumor development in Chinese cancer patients 
has unique characteristics, research into tumor 
heterogeneity based on cfDNA analysis may 
identify biomarkers more suitable for these 
patients. 

Recently, Ghoneim and colleagues reported 
that de novo DNA methylation plays a role in 
establishing the PD-1 blockade-nonresponsive 
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Recruitment status 


Cancer type (number of 


reliably predict 
overall survival 


41 
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Idbea 
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Nasopharyngeal cancer (1) ccemaaeineeten 
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Hong Kong Recruiting 1 Lung cancer (1) free survival and 
Lung cancer (4) overall survival 
R iti 7 in patients with 
ee Hepatocellular carcinoma (4) | P 
ung cancer, 
; 2 Lymphoma (2) uveal melanoma, 
Active, not recruiting 2 snd some tors 
Breast cancer (1) 
of colorectal 
Taiwan Enrolling by invitation 1 Head and neck cancer (1) cancer following 
treatment with 
Colon cancer (1) : 
nivolumab or 
Completed 2 Glioma (1) pembrolizumab 
; (41). 
Lue) Some Chinese 
Status unknown ) Others (2) companies, 


including Innovent 
Biologics (Suzhou) 
Co., Ltd., Jiangsu 
Hengrui Medicine 
Co., Ltd., and Shanghai Junshi Biosciences Co., 
Ltd., are developing new PD-1/PD-L1 inhibitors that 
are currently in clinical trials. The authors believe 
that the biomarkers described above would be 
very helpful in these trials. 


TABLE 1. Summary of clinical trials using ctDNA currently underway in China. Data from ClinicalTrials.gov 
(https://www.clinicaltrials.gov). 


state of CD8 T-cell exhaustion, and that 
administration of DNA-demethylating agents (such 
as decitabine) prior to PD-1 blockade therapy 

may enhance the reinvigoration of antitumor 

CD8T cells. This suggests that combining a PD-1 
blockade drug with a DNA-demethylating agent 
may improve the therapeutic response (40). 

In evaluating the efficacy of immunotherapy, it 
is worth noting that changes in cfDNA may occur 
earlier than changes in tumor size. Radiological 
evaluation of the efficacy of immune checkpoint 
inhibitors during therapy is therefore limited, and 
delayed clinical responses have been observed 
in some patients (78, 79). These results suggest 
that conventional radiological imaging may not 


Role of cfDNA in T-cell transfer immunotherapy 
T-cell immunotherapy is a potentially life-saving 
treatment for patients with advanced stage tumors. 

As with immune checkpoint inhibitor treatments, 
there are still no good biomarkers that can identify 
responders or predict patient response. Since 
changes in ctDNA can be used to monitor the 
course of a disease, treatment responses, and 
recurrence (42-45), ctDNA levels might be useful 
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Institution Research description 


Comparison of ctDNA and tumor tissue DNA by targeted sequencing in non-small cell lung cancer 


(NSCLC) 


Peking University 


Urinary ctDNA detection in NSCLC: a prospective study 


People's Hospital 


ctDNA detection in surveillance of surgical lung cancer patients 


Dynamic changes of ctDNA in surgical lung cancer patients 


ctDNA dynamic monitoring and its prognostic role in stage 1 NSCLC by NGS 


A prospective, observational trial of the diagnostic and prognostic uses of lung neoplasms and 


meningeal carcinomatosis 
Sun Yat-sen University 


ctDNA for the prediction of relapse in gastric cancer 


Detecting cell-free DNA in lung cancer patients 


Liquid biopsy in monitoring the therapeutic efficacy of targeted therapy in advanced/metastatic gastric 


cancer 


Peking University 


PD-1 knockout engineered T cells for castration-resistant prostate cancer 


PD-1 knockout engineered T cells for muscle-invasive bladder cancer 


Clinical application of ctDNA in operable breast cancer patients 


Peking Union Medical 
College Hospital 


Effectiveness of circulating DNA for predicting the relapse and overall survival in NHL patients 


Circulating cell-free DNA as a predictive biomarker for hepatocellular carcinoma 


Blood detection of EGFR mutation for Iressa (gefitinib) treatment 


Beijing Cancer 


Hospital 


Study of small doses of etoposide as maintenance treatment in small cell lung cancer (SCLC) 


TABLE 2. Leading research institutions doing clinical studies of circulating tumor DNA (ctDNA) on the Chinese mainland. 
Data from ClinicalTrials.gov (https://www.clinicaltrials.gov). NHL, non-Hodgkin's lymphoma; EGFR, epidermal growth factor receptor; 


NGS, next-generation sequencing. 


for monitoring efficacy of T-cell immunotherapy. 

A recent study showed that malignant melanoma 
patients who showed an early peak in ctDNA, but 
then cleared their serum of BRAF V600E ctDNA, 
were highly likely to show a complete treatment 
response over the next one to two years. This result 
may be instructive for other kinds of T-cell transfer 
immunotherapy approaches (46). 

In China, dendritic cell/cytokine-induced 
killer-cell therapy is still the most common T-cell 
immunotherapy, used mainly for the treatment 
of blood malignancies (47, 48), but its efficacy 
for treatment of solid tumors has yet to be 
demonstrated. There is an urgent need for 
appropriate biomarkers to predict efficacy and to 
screen patients to find those potentially responsive 


to T-cell immunotherapy. cfDNA- and ctDNA- 
related biomarkers might be good options to 
explore. 


Clinical trials of cfDNA in China 

Not surprisingly, clinical studies of ctDNA in 
precision cancer research are growing in number 
and importance both in China and worldwide. As 
shown in Table 1, there are currently 59 clinical trials 
involving ctDNA in China, including the Chinese 
mainland (41 studies), Hong Kong (1 study), and 
Taiwan (17 studies). Only 6 have been completed, 
while most are recruiting or preparing to recruit 
patients. Twenty-nine clinical trials involve lung 
cancer patients (particularly NSCLC), accounting 
for half of all studies underway, no doubt because 
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lung cancer causes more deaths worldwide than 
any other carcinoma. Certain cancers such as 
hepatocellular carcinoma and nasopharyngeal 
cancer get more attention in China than the United 
States or Europe, because of higher domestic 
incidence. Most sponsors of Chinese clinical 

trials involving ctDNA are domestic universities, 
hospitals, or companies, with the exception 

of the multinational pharmaceutical company 
AstraZeneca. Leading research institutions 
conducting clinical studies of ctDNA on the Chinese 
mainland are shown in Table 2. 


Future directions: cfDNA studies 
in the real world 

The application of cfDNA in the management 
of cancer diagnosis and treatment in China faces 
some challenges. First, it is important that cfDNA- 
related clinical data be gathered from Chinese 
patients, as data from other populations might 
not translate well to the local population. Studies 
should cover the entire process, from diagnosis 
through treatment, including early diagnosis; 
identification of tumor tissue sources; choice of 
therapy, whether chemotherapy, radiotherapy, 
targeted therapy, or immunotherapy following 
surgery; and decisions about optimization of 
treatment options. Complete clinical and follow- 
up data should be collected within a researcher- 
centered, normative multicenter clinical study, 
following international guidelines for NGS. A guide 
to the use of NGS with cfDNA samples must also 
be established so that results from these studies 
are comparable, accurate, reproducible, and can 
be easily applied in the clinic. Finally, government 
and industry should invest substantially to carry 
out this research and help to establish a model 
for collecting large amounts of data, including 
clinical data on cfDNA use, early cancer screening, 
treatment options, guidance on personalized 
treatments, prognosis, monitoring of drug 
resistance, real-time treatment adjustment, and 
monitoring of small lesions that may develop 
during treatment. 
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Next-generation 
sequencing—based testing 
for cancer precision 
medicine in China: 

A review of technologies 
and validation procedures 


Weifeng Wang, Weiwei Shi, 
Ming Yao, and Kai Wang” 


ith the advent of next-generation sequenc- 
ing (NGS), which is much faster than traditional 
sequencing methods, our understanding of cancer 
genomics has grown rapidly and is revolutionizing 
our ability to treat and manage cancers with pre- 
cision medicine. NGS has proven to be a reliable 
and precise diagnostic technology in developed 
countries. But in China it still faces challenges in the 
form of quality control and assurance problems, 
bioinformatics analysis 
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belief in cancer biology is that genetic alterations 
spur cancer initiation, evolution, and progression. 
However, early attempts at understanding 

these alterations were hindered by the cost and 
inefficiency of available technologies, such as Sanger 
sequencing. NGS-based testing, in contrast, can 
detect all classes of genomic alterations in a single 
test, and at a lower per-base cost. Those alternations 
detected include base substitutions, long or short 
insertions and deletions (indels), gene copy number 
variations, and gene fusions/rearrangements. Such 
comprehensive profiling provides a detailed genetic 
picture of a cancer patient. For instance, researchers 
have been unraveling the various mutation forms 

of the epidermal growth factor receptor (EGFR) 
gene, known to play a role in a number of different 
cancer types. In the past 15 years, researchers have 
developed therapies that target EGFR mutations, 
including base substitutions (1, 2), short insertions 
(3), duplications (4), and fusions (5) (Figure 1). 
Detection of such complex mutation profiles in 
individual patients would have been impossible 
without NGS-based testing. The seminal discovery 
about 10 years ago of the correlation between 

EGFR mutations and the clinical response to drugs 
called EGFR tyrosine kinase inhibitors (TKls) in non- 
small cell lung cancers (NSCLC) ignited the pursuit 


pipelines, clinical anno- 
tation, and the broad 
selection of competing 
NGS platforms. Here, we 
review NGS-based test- 
ing in China, looking at 
the challenges it faces, 
and its clinical utility in 
precision medicine for 
Chinese cancer patients 
now and in the future. 
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FIGURE 1. Timeline for the past 15 years of detecting EGFR genomic alterations and 
corresponding targeted therapies. 
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developing field of immuno- 
therapy by identifying those 
patients who are most likely 
to benefit from the PD-1/ 
PD-L1 inhibitor treatments. 
Tumor mutational burden 
(TMB)-the total number of 
mutations present in a tumor 
specimen-—is becoming an 
important metric for predict- 
ing a patient's response to 
immunotherapy (6). Tumors 
that harbor defects in the 
core DNA mismatch repair 
(MMR) protein complex are 

* Pou classified as being deficient 


i in MMR and as microsatel- 
IMB? lite instability-high (dMMR/ 
- BRAG? MSI-H) (7). In June 2017, the 


-MSI=H/dMMR? 


U.S. Food and Drug Adminis- 
tration (FDA) granted ac- 
celerated approval to pem- 
brolizumab, a drug targeting 
adult and pediatric patients 


This was the first time the 
agency had approved a 


of precision medicine, which tailors individualized 
therapies according to the genetic and cellular 
characteristics of the patient. Thanks to its technical 
accuracy and clinical utility, NGS-based testing has 
become the primary method of informing precision 
cancer therapy. 

In addition to its clinical utility, NGS-based test- 
ing advances the identification of those genomic 
changes that drive cancer, as well as the even larger 
number of passenger mutations that do not provide 
a growth advantage but simply “ride along,” signifi- 
cantly expanding our knowledge of the causes of 
cancer. With worldwide efforts from such projects as 
The Cancer Genome Atlas (TCGA) in 2005 and the 
International Cancer Genome Consortium (ICGC) in 
2008, as well as vigorous bioinformatics data analy- 
ses, researchers are constructing a complete muta- 
tional atlas of all cancer types and their associated 
alterations. In the treatment of NSCLC, for example, 
analysis of the EGFR and anaplastic lymphoma kinase 
(ALK) genes has become part of routine testing 
to help doctors choose the appropriate targeted 
therapies in first-line treatment, substantially improv- 
ing patient survival and quality of life. ln addition 
to its proven value in targeted therapy, NGS-based 
diagnosis panels are finding a place in the rapidly 


cancer treatment based on 
a common biomarker rather than the tissue type in 
which the tumor originated. In light of these devel- 
opments, comprehensive NGS-based gene profiling 
has the potential to act as a versatile tool for detect- 
ing not only gene alterations for targeted therapy, 
but also predictive markers associated with re- 
sponse rates to immunotherapy. This suggests that a 
well-designed combination of targeted therapy and 
immunotherapy based on genomic profiles could 
provide an impetus for new clinical trials to explore 
the relationship between the genomic data and the 
therapies. 

With the anticipated approval of additional drugs, 
the U.S.-based National Comprehensive Cancer 
Network (NCCN) has been updating its list of rec- 
ommendations for actionable gene testing in the 
treatment of NSCLC at an unprecedented pace (Fig- 
ure 2). For example, a structural gene variant called 
"MET exon 14 skipping,” which is caused by a defec- 
tive splicing of the gene’s RNA transcript, was added 
to the NCCN guideline on NSCLC shortly after the 
publication of papers that showed significant clinical 
benefits from MET inhibitors to patients with such 
mutations (8). In practice, NGS-based diagnosis can 
detect all of the NCCN-recommended targets, while 
having the flexibility to incorporate screening of 


identified as MSI-H or dMMR. 
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ARMS Digital Sanger : 
PCR PCR FISH IHC sequencing NGS (target sequencing) 
Sample usage _ Inefficient Inefficient Inefficient Inefficient Inefficient Efficient 
Throughput Low Low Low Low Low High 
oe 3 
Sensitivity of 0.1%-1% 0.1%-0.01% 1% 1% -5% 10% 1% (depends on sequencing 
MAF coverage) 
verre Known SNV) KnownSNV- Known CNV Protein Known and Known and unknown SNV, CNV, 
ariation type : : ‘ ; unknown SNV,__. f 
and indel and indel andfusion expression : indel, fusion 
indel, fusion 
Rew vane No No No No Yes Yes 


discovery 


TABLE 1. Comparison of detection techniques in clinical application.” ARMS, amplification-refractory mutation system; FISH, fluorescence in situ 
hybridization; IHC, immunohistochemistry; SNV, single-nucleotide variant; indel, insertion or deletion; CNV, copy number variant; MAF, mutant 
allele frequency. “Some special protocols not included. MAF sensitivity of FISH and IHC influenced by subjective interpretations. 


many more mutations as they become known. Laro- 
trectinib, the first small-molecule inhibitor specitical- 
ly targeting oncogenic chimeric proteins expressed 
from fusion genes containing neurotrophic tyrosine 
receptor kinase (NTRK) plus other gene partners, 
has demonstrated consistent and durable antitumor 
activity in clinical trials of both adults and children 
with 17 different cancers. It is thus expected that 
the detection of NTRK fusions, TMB, or MSI-H using 
NGS sequencing panels will probably be approved 
as companion diagnostic tests in the near future 
(Figure 2). All of this will allow clinicians to identify 
more cancer patients with unique deleterious gene 
variants and help them determine therapeutic op- 
tions quickly and cost-effectively. 

NGS offers a single universal test that identifies 
predictive biomarkers for both targeted and immu- 
notherapies. The high-throughput capacity of NGS- 
based testing is evident when it is compared with 
traditional molecular assays, such as fluorescence 
in situ hybridization (FISH) and polymerase chain 
reaction (PCR). FISH primarily detects amplifications 
and rearrangements of known genes, while PCR 
merely detects variations of a single nucleotide and 
known insertions or deletions (Table 1). So far, NGS 
is the best diagnostic technology for examining all 
types of genomic alterations in a single experiment, 
regardless of prior knowledge, and at much lower 
per-base cost than traditional techniques. 


Challenges facing NGS adoption 
Cancer is a major public health problem in China. 
According to a recent report, it is the leading cause 


of death in the country, with new patients diag- 
nosed at a rate as high as 12,000 cases each day 
(9). As a developing country with the world’s largest 
population of patients, China has a pressing need 
to provide precise and efficient therapies. Targeted 
therapy has been increasingly used in recent years 
in addition to traditional chemotherapy. Unfortu- 
nately, the genetic characteristics of tumors can 
vary widely, which means that not all tumors have 
the same therapeutic targets. Therefore, precise 
matching of treatments to each patient's clinically 
actionable targets will be crucial to the success 

of precision cancer medicine. This is of particular 
importance in developing countries like China 
where the cost of therapies is a big concern. Many 
of the top hospitals and cancer centers in the United 
States have established NGS screening platforms 
for cancer patients. In China, clinical molecular di- 
agnostic laboratories are also adopting NGS-based 
diagnostics. The technology is at the epicenter of 

a paradigm shift, and has attracted a tremendous 
amount of attention from doctors, scientists, and 
entrepreneurs, which, in turn, is catalyzing its refine- 
ment and growth. 

NGS-panel-based diagnosis is still a nascent tech- 
nology in China and poses many specific challeng- 
es, both in terms of techniques used and manage- 
ment. One issue is that not all Chinese diagnostics 
providers can ensure the quality of their products. 
When facing fierce market competition and loose 
regulations, many companies fail to meet basic 
quality control standards. In a nationwide external 
quality assessment carried out in 2015, only 27% 
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Number 
of genes Advantages Disadvantages Price 
examined 
1-3 Well-recognized and studied genes; More samples needed; may miss important $ 
traditional gene testing methods used for treatment options 
years 
<10 Well-recognized genes in certain cancer types May miss critical treatment options $ 
~40 Covers all approved drugs in China May miss novel treatment options $ 
~100 Currently approved drugs; off-label usage Not reliable on CNV or TMB $$ 
~500 Covers all cancer-related genes to offer Relatively expensive $$ 
potential novel treatments, especially for 
relapsed patients 
Whole- Covers all human genes. Offers accurate TMB Expensive with the risk of missing important $$$$ 
exome determination and better CNV calling treatment options due to missing actionable 
sequencing gene mutations caused by low coverage or 


gene fusions 


TABLE 2. Comparison of sequencing panels available in China that analyze different numbers of genes.* CNV, copy number variant; TMB, tumor 


mutational burden."NGS on capture-based Illumina platforms only. 


of participating laboratories were able to detect all 
known variants in standard samples provided by the 
National Center for Clinical Laboratories of the Chi- 
nese Ministry of Health. This prompted the Chinese 
Society of Clinical Oncology and the China Action- 
able Genome Consortium to release a consensus 
statement on the application of NGS technology 

to precision cancer medicine. Similarly, in 2017, the 
U.S. Association for Molecular Pathology and the 
College of American Pathologists published a joint 
recommendation on guidelines for validating NGS- 
based cancer sequencing screening panels (70). As 
expected, both sets of recommendations empha- 
sized the importance of standard NGS operating 
procedures and reliable NGS equipment to guar- 
antee high-quality genetic information from cancer 
samples. Stringent standard operating procedures 
and quality control checkpoints should be imposed 
at each step of the NGS-based testing pipeline. This 
includes sample preparation, DNA library prepa- 
ration, sequencing, data analysis, annotation of re- 
sults, and reporting of results. Additional attention 
should be paid to the criteria by which NGS panel 
tests are judged to be valid. These include types 
and number of samples, reproducibility and repeat- 
ability of variant detection, reportable range (region 
of the genome with acceptable quality results), 
reference range (the spectrum of nonpathogenic 
base changes observed in a population), limits of 
detection, interfering substances, clinical sensitiv- 
ity and specificity, and if appropriate, validation of 


bioinformatics pipelines and other parameters (77). 
As emphasized by OrigiMed, a Shanghai-based 
diagnostics service provider, NGS-based diagno- 
sis in China should have meticulously designed, 
well-tested platforms run by trained personnel, and 
backed up by adequate infrastructure. Another 
recommendation was that annotations explaining 
genomic alterations and potentially actionable sites 
or signaling pathways should be clearly described 
for each gene. 

The depth and breadth of sequencing coverage 
are two important technical factors affecting the 
quality and cost of NGS-based testing (Table 2). 
The numbers of target genes analyzed can differ 
substantially depending on the type of screening 
used, with the breadth of sequencing ranging 
from just a few variants to a few hundred using 
sequencing panels, to the entire coding gene 
region [whole-exome sequencing (WES)] or even 
to the entire genome [whole-genome sequencing 
(WGS)]. The selection of which genes to screen 
for in clinical applications more or less obeys the 
“Goldilocks principle,” namely that a small number 
of genes may generate an insufficient number of 
clinical clues, while a large number of genes could 
produce too much information. Specifically, the 
excessive genomic information generated by large 
sequencing ventures provides limited additional 
clinical value while posing the huge challenge of 
having to interpret incidental findings of variants of 
unknown or uncertain significance. Furthermore, 


sequencing too wide a DNA region drives up 
expense due to the need to sequence to sufficient 
depth for the data to be valid, and can lead to 

poor sensitivity if depth of coverage is insufficient. 
More narrow sequencing coverage costs less, but 
significantly decreases detection sensitivity and may 
ultimately result in data that is difficult to interpret. 

If sequencing coverage is less than 300x, only 84% 
of low-frequency single-nucleotide variants (<5% 
mutant allele frequency) are detected, and the rate 
of detection for short variants is even poorer (72). 
Previous publications have shown that druggable 
mutations with low mutant allele frequency cannot 
be detected by low-coverage WGS/WES, or by 
sequencing panels, and that this may affect treatment 
options. On the other hand, most gene fusions, such 
as those found in the ALK gene, cannot be detected 
by WES alone unless the breakpoints are within or in 
close proximity to exons, and may require WGS. 

A small NGS panel offers an affordable way to 
sequence fewer than 10 genes containing mutations 
common in certain cancer types, but risks missing 
several critical treatment options due to incomplete 
gene profiling (Table 2). The incorporation of more 
genes into the panel improves the quality of ge- 
nomic mutation profiling and increases the number 
of treatment options, but it still fails to reflect the 
overall TMB, which is an important predictor of how 
effective immunotherapy will be. In order to maxi- 
mize a patients’ treatment options, an even larger 
panel consisting of several hundred relevant genes 
is recommended. However, this is not always avail- 
able in China. The leading NGS panel-based prod- 
uct in the Chinese market, YuanSu from OrigiMed, 
sequences 450 well-defined cancer genes known 
to be important for targeted therapy, and analyzes 
TMB, microsatellite instability, and potential resis- 
tance to immunotherapy treatment (also incorporat- 
ing RNA sequencing to detect more gene fusions 
and splice variants). 


The present and future of NGS-based 
testing in China 

In 2015, the Chinese Ministry of Science and Tech- 
nology held the first nationwide conference of ex- 
perts to devise and promote a strategy for precision 
medicine. One of the primary accomplishments of 
the conference was a pledge from the government 
to invest 60 billion RMB (US$9.24 billion) in the field 
of precision medicine by the year 2030. NGS-based 
technology is one of the key components of this 
investment. 

China is preparing for the expanded use of NGS- 
based technology in precision medicine, especially in 
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the area of cancer. The American company Illumina 
retains the largest share of the market for sequencing 
hardware, with around three hundred NGS machines 
installed so far; other vendors provide the majority of 
reagents and consumables. But several domestic Chi- 
nese companies have entered the NGS market in re- 
cent years. More than 10 companies have developed 
their own NGS machines, some of which have been 
approved by the China Food and Drug Administra- 
tion. These include the BGISEQ-500, developed by 
BGI, and the BioelectronSeq 4000, made by Capital- 
Bio Corporation. However, domestic companies still 
have a long way to go to reach the high bar set by 
their foreign competitors. In the sequencing services 
and clinical testing segment of the market, the lack 
of good training and supervision means that most of 
the approximately 160 sequencing service compa- 
nies in China's eight major cities fail to meet reliability 
standards for their sequencing data analysis. 

China is still behind the United States in the 
clinical use of NGS-based testing. Most hospitals in 
China remain heavily reliant on traditional molec- 
ular diagnostics like PCR, immunohistochemistry, 
and FISH as the major cancer diagnosis and prog- 
nosis tests. There are three main reasons why this 
situation exists. First, NGS-based testing involves a 
series of procedures—including sample preparation, 
DNA library construction, sequencing, bioinformat- 
ics analyses, and clinical annotation—that each re- 
quire stringent standards of operations and quality 
control and assurance. As discussed above, China 
is not readily set up for these requirements in most 
domestic hospital laboratories. While the business 
of commercial assays has boomed in the past two 
years, only a few companies have the necessary 
analytical and clinical validation for their NGS-based 
assays. And the importance of validation is not yet 
appreciated by every doctor in China. 

Second, numerous targeted cancer therapies 
have been approved by the U.S. FDA, and many 
more are being studied in clinical trials, but expe- 
rience in the appropriate use of the new drugs is 
relatively limited in China. Doctors and patients 
tend to opt for traditional diagnostics like single 
gene-based tests, instead of comprehensive diag- 
nostics like NGS-based assays. Finally, the cost of 
NGS-based testing is still too high for most Chinese 
patients. Generally, the price of a comprehen- 
sive NGS-based assay ranges from US$1,500 to 
US$3,000, depending on the size of the panel assay 
and the quality. 

Although NGS-based cancer testing is still in its 
infancy, it is soon expected to become the major 
component of worldwide clinical NGS diagnostics, 
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including in China. It is expected that academia 
and industry in China will continue their efforts at 
building sequencing hardware and designing re- 
agents. More importantly, China will be expanding 
the academic and clinical application of NGS-based 
assays with a focus on exploring cancer etiology 
and pathogenesis, and providing a genetic basis 
for personalized therapy with the help of cancer 
genome sequencing, as well as integrating various 
NGS-based technologies to aid ongoing research 
in conventional clinical medicine. The hope is that 
these steps will lead to significant improvements 

in precision medicine, including disease diagnosis, 
treatment, and clinical decision-making for Chinese 
cancer patients. 


Looking to the future 

NGS-based testing is one of the most exciting 
developments in oncology research and clinical 
application in recent years. It provides a more com- 
prehensive and efficient approach for characteriz- 
ing the genomes of individual cancer patients and 
matching them with treatments for optimal clinical 
management. Most hospitals in China are not fully 
ready to run NGS platforms in-house. But with the 
concerted efforts of the government, researchers, 
and companies, precision cancer medicine in China 
is developing rapidly. More detection technologies 
are emerging for clinical applications, including 
ultrasensitive circulating tumor DNA detection and 
exosome and epigenome profiling. We envision a 
future where each cancer patient will be tested by 
a comprehensive gene panel at several points in 
time and wherever they are receiving treatment, 
with a range of sequencing technologies that will 
detect multilevel molecular abnormalities and help 
clinicians design a personalized, precise therapy 
tailored to their individual needs. 


References 
1. J.G. Paez et al., Science 304, 1497-1500 (2004). 
2. T.J.Lynch et al., N. Engl. J. Med. 350, 2129-2139 (2004). 
3. H.Greulich et al., PLOS Med. 2, e313 (2005). 
4. J.N. Gallant et al., Cancer Discov. 5, 1155-1163 (2015). 
5. K.Konduri et al., Cancer Discov. 6, 601-611 (2016). 
6. D.P. Carbone et al., N. Engl. J. Med. 376, 2415-2426 
(2017). 
7. D.T. Le et al., Science 357, 409-413 (2017). 
8. G.M. Frampton et al., Cancer Discov. 5, 850-859 (2015). 
9. W.Chen et al., CA Cancer J. Clin. 66, 115-132 (2016). 
10. L.J. Jennings et al., J. Mol. Diagn. 19, 341-365 (2017). 
11. H.L. Rehm, Nat. Rev. Genet. 14, 295-300 (2013). 
12. G.M. Frampton et al., Nat. Biotechnol. 31, 1023-1031 
(2013). 


ctDNA-NGS: The key to 
unlocking a molecular 
diagnostic revolution in 
the heart of Asia 


Ying Hou and Kang Ying 


wealth of information about a cancer can be 
obtained using liquid biopsy, a noninvasive blood 
test. The diagnostic potential of liquid biopsy is best 
illustrated by the development of the companion di- 
agnostic for osimertinib (a third-generation targeted 
therapy for patients with non-small cell lung cancer). 
Osimertinib was approved by the Chinese Food and 
Drug Administration in just seven months, a land- 
mark achievement in lung cancer therapeutics. A 
major contributor to its success was the codevelop- 
ment of a companion assay for detecting the EGFR 
T790M mutation (7-3), which commonly emerges 

in lung tumors after treatment with first-generation 
therapies. When tissue samples were unavailable, 
mutation detection could be done using liquid biop- 
sy technology, which required only a small sample 
of patient blood. Liquid biopsy testing has been so 
effective in tracking EGFR T790M that the practice 

is now strongly recommended by the European 
Society for Medical Oncology (4) and in the National 
Comprehensive Cancer Network (5) guidelines. In 
addition, the Chinese Lung Cancer Summit Commit- 
tee formally endorsed the use of liquid biopsy for 
the detection of EGFR T790M in 2016 (6). Although 
the emergence of a tissue-based molecular diag- 
nostic market in Asia has been driven largely by the 
high occurrence of epidermal growth factor receptor 
(EGFR) mutations in the region, EGFR mutation de- 
tection is just the beginning. With a concerted effort, 
liquid biopsy technology has the potential to rede- 
fine the field of clinical molecular diagnostics not just 
in Asia, but worldwide. 


Challenges for Chinese ctDNA-NGS 
clinical diagnostics 

The tremendous potential of liquid biopsy is 
offset by the technological challenges it presents. 
Any practical liquid biopsy diagnostic assay 
based on circulating tumor DNA (ctDNA) must 
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Technology Barcode-based error-correction 


sequencing 


Metric Concatemer-based error-correction sequencing 


AVENIO 
ctDNA 
Expanded Kit 


Oncomine cfDNA 


Lung Cancer Panel Nebula: Accu-Act 


Example Comet: Accu-Kit 


Panel coverage 198kb ~2kb 100kb ~2kb 
SNV/indel sensitivity ~10 copies? <10 copies? <10 copies* 
SNV/indel error rate <0.6%? 0.2-2%> 0.03%° 

Workflow ease-of-use ++ +++ + +++ 

Design flexibility +++ + +++ ++ 


TABLE 1. Comparison of liquid-biopsy ctDNA sequencing technologies. SNV, single nucleotide variant; indel, insertion or deletion; kb, kilobases. 
®30-ng input, 0.1% allele frequency (AF) (27). '20-ng input, 0.1% AF (28). ‘AccuraGen analytical data for Firefly technologies. No false positive in 
50 noncancerous individual plasma samples and 24 wild-type cfDNA standards. Average of 96% detection rate for 20-ng input at AF of 0.1%. 


have an extremely high and readily reproducible 
performance sensitivity and specificity (7, 8). 
CtDNA-fragmented DNA from a tumor that is 
free-floating in the peripheral blood—is present 
in minuscule quantities (on the order of a few 
nanograms per milliliter of plasma). As a result, 
isolating ctDNA signals amid the noise generated 
by noncancer DNA fragments and sequencing 
errors is difficult. Droplet digital polymerase 
chain reaction (ddPCR) and next-generation DNA 
sequencing (NGS) are among the only quantitative 
technologies able to meet the sensitivity demands of 
low-frequency mutation detection in ctDNA (9, 70). 
Between the two, NGS offers several advantages. It 
has greater multiplexing capacity (where multiple 
genes can be analyzed in parallel from the same 
input) and is better at detecting de novo mutations. 
The performance sensitivity and specificity of NGS 
technologies, however, varies significantly (71-13). 
CtDNA testing is also not without regulatory 
constraints. Unlike in the United States, medical 
facilities in China perform most clinical testing in- 
house. While this is an effective cost-saving practice 
for established testing protocols, the ctDNA-NGS 
diagnostic assay is difficult to deploy on-site with- 
out highly specialized technicians. As a result, the 
use of ctDNA-NGS testing is carefully monitored by 
the Chinese National Health and Family Planning 
Commission (NHFPC) (14). To date, 26 facilities have 
received government authorization to offer NGS 
testing for the purposes of oncological treatment. 
While the availability of these services is a promising 
first step, a formal assay performance evaluation 
conducted in 2015 by NHFPC revealed an alarming 
variability in results. Of the 64 sites (including the 26 


facilities authorized to perform ctDNA-NGS assays), 
33 passed the evaluation and among these, only 17 
received perfect marks (75). The lack of technical 
development and operational protocols at these pi- 
lot sites has resulted in the use of off-site, third-party 
testing services, reducing adoption. Thus, the rapid 
adoption of ctDNA testing in China is contingent 
upon the standardization of testing protocols. 


One solution: hospital-owned ctDNA-NGS 
clinical labs 

One way to promote the use of ctDNA-NGS is by 
providing widely accessible testing guidelines or 
kits that would enable hospitals to confidently per- 
form in-house NGS testing. This “kit-able” solution 
would include such essential features as a “low- 
touch workflow” requiring little or no NGS expertise 
and a self-contained data analysis pipeline. Such a 
kit could then be easily translated into an automat- 
ed technology for informing personalized therapeu- 
tic treatment at scale. 

Although a few kit options are available for tumor 
tissue sequencing, kit options for NGS-based ctDNA 
sequencing are far scarcer. Historically, performance 
comparisons of NGS-based ctDNA kits have high- 
lighted their relative trade-offs. Capture-based assays 
have demonstrated high specificity and multiplexing 
capacity but limited sensitivity and a complex work- 
flow. A single AVENIO capture-based assay from 
Roche (17, 16), for example, can cover >100 kilobases 
and detect single-nucleotide variants, insertion- 
deletions, and fusion alterations at a rate of 96%- 
99%, at allele frequencies of 0.5%-1%. In contrast, 
amplicon-based assays (77) offer high-performance 
sensitivity and ease of use (the Thermo Fisher 
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Oncomine kit has a turnaround time of just two days), 
but provide limited coverage and few opportunities 
for assay customization. Amplicon-based assays also 
tend to produce more false positives. We believe 
that AccuraGen’s technology, Firefly, brings together 
the best of amplicon- and capture-based assays (78). 
Firefly can detect a breadth of genetic alterations 
with a sensitivity approaching the physical limit and 
a specificity 10-fold better than benchmark, all while 
maintaining the added benefits of flexible panel de- 
sign and workflow integration. It has a consistent error 
rate of 1 in 1 million at the molecular level and can 
uncover 0.1% of variants in 20-ng samples of cfDNA 
at a detection rate greater than 90%. 

The backbone of the Firefly technology is its 
use of rolling-circle amplification to empower 
consensus-based concatemer error correction. 
Single-stranded ligation without the usage of any 
barcode results in a high conversion rate, and 
rolling-circle template amplification minimizes the 
proliferation of enzymatic errors common in PCR. Fi- 
nally, tandem repeats in concatemers make mutation 
detection both accurate and economical. 


Emerging challenges facing NGS-based 
ctDNA mutation assays 

Although the clinical utility of ctDNA sequencing is 
currently oriented toward targeted therapy selection, 
mounting evidence suggests it may also be used 
to track the progression of cancer (19). Two studies 
monitoring postsurgical residual disease in breast 
(20) and colorectal (21) cancer patients demonstrat- 
ed the predictive value of ctDNA in determining 
the likelihood of cancer relapse was far greater than 
estimates made using traditional clinical imaging 
evidence. While these studies foreshadow exciting 
future applications of ctDNA liquid biopsy assays, 
their limited genetic scope and small patient sample 
sizes impede their clinical actionability. Fortunately, 
new tools with increased panel sizes and product ac- 
cessibility have since emerged. Recently, UK-based 
TRACERx [TRAcking Cancer Evolution through 
therapy (Rx)] reported the superiority of a 30-plex 
panel over a single plex or lowplex panel (up to 4 
amplicons) in detecting cancer relapses and tumor 
progression (22, 23). Empowered by Firefly technol- 
ogy, AccuraGen currently offers Accu-Act, a 61-gene 
capture-based panel for residual disease monitoring 
and strategic cancer management. AccuraGen's am- 
plicon-based ctDNA kit, Accu-Kit, which detects only 
genes included in NCCN guidelines, is easy to use, 
customizable, and affordable. 

Increasing panel size, however, creates new chal- 
lenges for developing a viable assay. The underly- 


ing genetics of tumor cells vary tremendously both 
within a single tumor and from one tumor to another. 
How will intra- and intertumoral heterogeneity be 
addressed? How can assay results be both high- 

ly sensitive and easily reproduced so they can be 
clinically actionable? How will diversified subclone 
mutation profiles be integrated to generate a holistic 
overview of the disease? Although these problems 
are complex, they are not insurmountable. 

A combination of ctDNA-NGS kits and laboratory- 
developed tests will likely address the majority of 
known clinical needs for ctDNA. Though China’s 
NHFPC has yet to publish official guidelines, several 
clinical committee consortiums have released 
their own recommendations to guide independent 
clinical testing (24-26). AccuraGen is uniquely 
positioned to empower NHFPC-certified facilities to 
act on these recommendations by equipping them 
with clinical kits for on-site testing. By making liquid 
biopsy technology more accessible and lowering 
barriers to workflow integration, we believe that 
AccuraGen can help bring cutting-edge research 
to the bedside. 

Although the most effective way to encourage 
widespread liquid biopsy adoption would be CFDA 
approval of a ctDNA-NGS in vitro diagnostic kit, the 
likelihood of this occurring in the near future is low. 
In the short-term, ctDNA-NGS assays will need to 
be performed in-house at NHFPC-approved clinics. 
Those facilities will need the expertise to generate 
high-quality mutation profiles; they already regu- 
larly perform a wide array of diagnostic tests. At 
AccuraGen, we believe that with superior technolo- 
gy, ctDNA-NGS assays can be as easy to use as any 
other tools in the clinic's toolbox. 

Regulatory processes, while slow, are necessary 
safeguards for delivering quality services. But regula- 
tion should not stonewall innovation. New products 
envisioned by teams of scientific experts and sea- 
soned physicians will still be developed. AccuraGen 
is committed to advancing the frontiers of molecular 
diagnostic testing using ctDNA-based NGS technol- 
ogy, and we believe that China is in the vanguard of 
a revolution in this area. 
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Adoptive cell transfer 
therapy: A strategic 
rethinking of combination 
cancer therapy 


Minghui Zhang 


n recent decades, significant progress has 
been made in immunotherapy treatment, which 
is regarded by some as the latest battlefront in 
a long war against cancer. In 1890, the father of 
immunotherapy, American bone surgeon William 
Coley, used a mixture of two dead infectious 
bacteria strains to treat patients with inoperable 
cancers and achieved a complete remission rate of 
10%. American biologist Paul Ehrlich speculated 
about the possible existence of immune tumor 
surveillance—the process by which the body’s 
immune system is constantly seeking out and 
destroying tumor cells. As more tumor antigens 
are identified and characterized, we gain a deeper 
understanding of how the immune system fights 
tumors. Recently, several “immune checkpoint 
blockers,” which promote the ability of certain 
cells to attack cancer, have had clinical success. 
Adoptive chimeric antigen receptor (CAR) T-cell 
transfer, a therapy in which genetically modified 
T cells are transferred into a patient [and a type of 
adoptive cell transfer (ACT)], has been approved 
by the U.S. Food and Drug Administration. But the 
efficacy of these treatments needs improvement, 
and researchers are still trying to mitigate their 
adverse side effects. 

Scientists are also looking for ways to develop 
effective therapies that combine immune treatment 
with conventional cancer therapies to increase 
therapeutic benefit, minimize harm, and provide a 
cure or long-term remission. Tumors use multiple 
means to evade detection by the immune system. A 
better understanding of the mechanisms of tumor 
development and the reasons for success or failure 
of current therapeutic strategies is needed for the 
development of more effective treatments. ACT, 
which in essence provides more effective immune 
cells to fight against tumor cells, is playing a key role 
in cancer treatment today. However, there are sever- 
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FIGURE 1. Factors that allow for tumor development. MHC, major histocompatibility complex; CTL, cytotoxic T-lymphocyte; Treg, regu- 
latory T cell; Breg, regulatory B cell; NKreg, regulatory natural killer (NK) cell; DCreg, regulatory dendritic cell; MDSC, myeloid-derived 


suppressor cell; APC, antigen-presenting cell. 


al different forms of ACT. Here, we will discuss ways 
that ACT might be exploited as a broad-spectrum 
therapy and be combined with traditional treatments 
to increase its efficacy. 


Tumor development 

It is well accepted that cancer is a disease 
that develops and progresses because of the 
accumulation of genetic and epigenetic changes 
that affect the way genes are expressed. Genetic 
drivers of tumor formation and growth are 
confirmed by gene sequencing profiles and the 
study of proteins expressed in and on the tumor 
cells. However, deeper understanding of the 
effects of the immune response on tumors is 
leading to the realization that the immune system 
is the most important extrinsic influence on the 
fate of mutant cells: If immunosurveillance works 
correctly, mutant cells will be recognized and 
eradicated (Figure 1), explaining why certain 
people don't get cancer. If the immunosurveillance 
system cannot recognize or eradicate the mutant 


cells, these cells will have the opportunity 
to develop into a tumor. This principle is 
also expressed as the “three Es” of cancer 
immunoediting: elimination (the immune system 
finds and destroys cancer cells), equilibrium 
(cancer cells that are not destroyed right away may 
exist in a delicate balance between growth and 
control by the immune system), and escape (the 
tumor cells escape immune system surveillance 
and begin growing in an environment in which the 
immune response is suppressed). 

Substantial genetic heterogeneity has been 
detected both in different samples from the 
same kind of primary tumors and metastases, 
and in individual cancers from the same patient. 
Precision cancer medicine in its current form 
is heavily dependent on genetic testing, and 
promotes the molecular characterization of tumors 
in order to develop more effective and beneficial 
drugs for patients carrying certain mutations. 
However, the benefits often don't persist, as the 
tumor cells can develop resistance to drugs by 


the upregulation of a partially inhibited pathway, 
a mutation of the target gene, or an activation 
of alternative pathways, for example. Many 
patients will experience drug resistance, cancer 
progression, and ensuing metastasis. 

The immune system can recognize mutant cells 
and eliminate many early malignancies. However, 
tumors have a variety of means to evade immune 
attack. They can decrease the expression of major 
histocompatibility complex class | molecules to 
escape natural killer (NK)-cell cytotoxicity, and 
increase the expression of coinhibitory molecules 
to inhibit T-cell activation and function. A tumor 
may also establish an immunosuppressive state 
in the tumor microenvironment by producing 
regulatory cytokines, including indoleamine 
2,3-dioxygenase, vascular endothelial growth 
factor, and TGF-B, and by recruiting myeloid- 
derived suppressor cells and regulatory T cells. 
Overcoming these obstacles and strengthening 
the immune response are the keys to successful 
immunotherapy. 


The current state of ACT 

Immunotherapy can be classified into two broad 
categories. The first is nonspecific immune stim- 
ulation, and includes cytokine immunotherapy 
(which acts to stimulate the immune system to 
fight the cancer), oncolytic virus immunothera- 
py (which selectively kills tumor cells while also 
stimulating antitumor immunity), Toll-like receptor 
agonist therapy (which activates both innate and 
adaptive immune responses), and some forms of 
ACT (such as the transfer of natural killer cells into 
the patient), among others. The second category 
is specific immunotherapy, including dendritic cell 
(DC) vaccines, other kinds of ACT (for example, 
the transfer of T cells), and immune checkpoint 
inhibitors. 

Among these approaches, ACT may be the most 
complicated. As eluded to above, it encompasses 
both nonspecific and specific immune stimula- 
tion. The cells are produced in one of two ways. 

In the first strategy, the doctor may isolate spe- 
cific immune cells from a patient's tumor (such as 
tumor-infiltrating lymphocytes) or from surround- 
ing blood, culture them in the lab to expand their 
numbers, and then administer them back into the 
patient. In the second strategy, T cells or NK cells 
collected from the patient are engineered with 
new receptors to recognize specific antigens on 
the surface of cancer cells, generating so-called 
CAR-T cells or CAR-NK cells. The recombinant 
cells are then infused back into the patient fol- 
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lowing expansion in culture. ACT therefore uses 
immune cells to either promote the immune 
response or to destroy the cancer cells that carry 
specific antigens. 

CAR-T is currently the most promising strategy. 
It has demonstrated clinical benefits in some leu- 
kemias (1-4). However, because few specific tumor 
antigens have been identified, the effect of CAR-T 
on solid tumors is still marginal. In addition, the 
side effects of CAR-T can be dangerous or even 
life threatening (4-6). 

Less attention is being paid to nonspecific 
ACT therapies (NK or DC therapy). This may be 
because the effect of these therapies is hard to 
assess using the present evaluation system. The 
transferred cells often trigger inflammation inside 
the tumor, leading to an apparently larger tumor 
in the short term. Another key factor is the timing 
of ACT. Some advanced tumor patients have tried 
various other treatments before they choose cel- 
lular therapy. It is therefore difficult to isolate the 
specific effects of the ACT treatment and evaluate 
what its real clinical benefits might be if used at 
the optimal time. Furthermore, ACT therapy must 
be adjusted depending on the patient's condi- 
tion, the type of cell transferred, the number of 
effective cells, and the frequency of treatment, 
making it complicated to administer. 


The future of ACT 

The complexities of tumors, including their 
heterogeneity and their ability to suppress an im- 
mune response, are at least partly responsible for 
therapeutic failures. There has been great prog- 
ress made with traditional cancer treatments, in- 
cluding surgery, chemotherapy, radiation therapy, 
and targeted therapy. However, these modalities 
often cannot completely eradicate tumor cells. 
And sometimes micrometastases are occurring 
even as the tumor is diagnosed. Scientists face 
the question of how to build a new strategy that 
preserves the benefits of traditional treatments 
but overcomes their limitations and provides 
long-term remission or even a cure. 

Combining multiple treatments offers a 
potentially powerful approach. The innate and 
adaptive parts of the immune system work 
together to sustain a state of equilibrium in the 
body. If we can understand more deeply the 
antitumor mechanisms of the immune system, 
design ACT therapies accordingly, and use them 
in combination with traditional cancer treatments, 
tumor cells could be eradicated and active 
immunosurveillance reinstated. Thus, in addition 
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FIGURE 2. Combinatorial cancer treatment. DC, dendritic cell; NK, natural killer cell; NKT, natural killer T cell. 


to killing tumor cells, one of the most important 
effects of ACT is to reverse a tumor’s ability to 
suppress the immune response. A strengthened 
immune system can work as a policeman so that 
tumor cells that are not killed by traditional cancer 
treatment can be either eradicated or controlled 
by the immune system. We predict that this type 
of combination therapy will lead to the elimination 
of many cancers, or at least to longer survival with 
no worsening of the symptoms (Figure 2). 

In the battle against tumors, surgery can 
be considered the vanguard, decreasing the 
tumor burden and eradicating most tumor cells. 
Chemotherapy, radiation therapy, and targeted 
therapy are the troops sent in to attack inoperable 
cancers. And immunotherapy, especially ACT, 
becomes the rear guard, killing escaped tumor 
cells and creating a microenvironment in which 
metastasis cannot occur. A “special forces” corps— 
for example, more recently identified cell subsets 
that are strongly cytotoxic to tumors—would be a 
major help to the body in its battle against cancer. 
Each patient is different, and winning this battle 
requires a personalized regimen in which ACT 


therapy with broad-spectrum antitumor effects 
may be a critical factor. 

Following this principal, we treated cancer 
patients with personalized combination treatment 
protocols. Based on our finding that genetically 
dissimilar DCs can induce the expansion of a spe- 
cific subset of natural killer T (NKT) cells that are 
more strongly cytotoxic to tumor cells, we carried 
out a clinical trial of ACT with NKT cells taken from 
patients with solid tumors. We used NKT cells 
cultured in vitro to treat patients after surgery to 
decrease the tumor burden, or after chemothera- 
py or radiotherapy to kill most of the tumor cells. 
We observed in this study that many recipients 
of targeted therapy took longer to develop drug 
resistance when their therapy was combined with 
adoptive NKT transfer. The preliminary results of 
the study are quite promising, and some of the 
patients have experienced a complete remission 
of their cancer (Figure 3, next page) (unpublished 
observations). Final results are still pending. 

Our strategy suggests that well-designed, 
personalized combination cancer treatments, 
especially those using adoptive transfer of cells 
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FIGURE 3. The clinical benefits of adoptive novel NKT cell transfer in two patients. A gastric cancer patient with lung metastasis (upper 
panel) and a pancreatic cancer patient with liver metastasis (lower panel) were treated with adoptive transfer of NKT cells after surgery. 
The images show that the tumors in both cases shrink and eventually disappear completely. 
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faculty position at the Assistant, Associate or Full Pro- 
fessor level in the Department of Cellular and Mo- 
lecular Physiology, Penn State University College of 
Medicine. We are seeking Ph.D. and/or M.D. candi- 
dates with outstanding research accomplishments in 
structural biology, and strengths in cryo-EM tech- 
nology. Further details are given at website: https:// 
jobs.sciencecareers.org/job/475106 /assistant- 
associate-or-professor-cellular-and-molecular- 
physiology/ and applications submitted to website: 
https://psu.jobs/job/76573. Applications should 
include a cover letter, Curriculum Vitae, and statement 
of research plans, as a single PDF. Penn State is an Equal 
Opportunity, Affirmative Action Employer, and is committed to 
providing employment opportunities to all qualified applicants with- 
out regard to race, color, religion, age, sex, sexual orientation, gender 
identity, national origin, disability or protected veteran status. 


* 1,877,103 


unique job seekers 


250,657 


job applications in 2016 


Careers 


XX) Wake Forest” 
School of Medicine 


The Department of Neurobiology and Anatomy 
at Wake Forest Medical School announces open 
positions for 


Postdoctoral Training in 
Multisensory Processes 


We seek strong candidates for postdoctoral 
training funded by an NIH T32 Training 
Grant. The training program provides a rich 
collaborative research environment that fosters 
interdisciplinary approaches to understanding 
how the brain integrates information from 
multiple senses to produce perception and 
adaptive behavior. Candidates with direct 
experience as well as those in related fields are 
encouraged to apply. Trainees will have access to 
any of 9 laboratories using human subjects and/ 
or a variety of animal models rodents-primates 
with approaches spanning molecular/cellular to 
perceptual/behavioral. Fellowships are awarded 
on a competitive basis. 


Applications including a current curriculum vitae 
or nominations should be sent to the Training 
Grant Director: Dr. Barry E. Stein, bestein@ 
wakehealth.edu, or to its Co-Directors: Dr. 
Terrence Stanford, stanford@wakehealth. 
edu and Dr. Dwayne Godwin, dgodwin@ 
wakehealth.edu. A description of the faculty 
and the program can be accessed via the 
website: http://graduate.wfu.edu/admissions/ 
t32/training tpmp.html. Please also submit your 
application and CV to job opening 26486 at www. 
wakehealth.edu/jobssearch/. 
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Research Position at ICYS, NIMS, Japan 


The International Center for Young Scientists (ICYS) of the 
National Institute for Materials Science (NIMS) is now seeking a 
few researchers. Successful applicants are expected to pursue 
innovative research on broad aspects of materials science using 
most advanced facilities in NIMS (http://www.nims.go.jp/eng/ 
index.html). 


In the ICYS, we offer a special environment that enables young 
scientists to work independently based on their own idea and 
initiatives. All management and scientific discussions will be 
conducted in English. An annual salary approximately 5.35 million 
yen (level of 2017) will be offered depending on qualification and 
experience. Additional research grant of 2 million yen per year 
will be supplied to each ICYS researcher. The initial contract term 
is two years and may be extended by one more year depending 
on the person’s performance. 


All applicants must have obtained a PhD degree within the last 
ten years. Applicants should submit an application form including 
a research proposal to be conducted during the ICYS tenure, 
CV Header, CV with list of publications and patents (Be sure to 
attach the header), list of DOI of journal publications following 
our instruction, reprints of three significant publications to ICYS 
Recruitment Desk by March 29, 2018 JST. The application form 
and CV header can be downloaded from our website. Please 
visit our website for more details. 


ICYS Recruitment Desk, 
National Institute for Materials Science 
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HEALTH SCIENCE CENTER 


NEUROSCIENCE FACULTY POSITION 


The Department of Anatomy and Neurobiology (http://www.uthsc.edu/ 
anatomy-neurobiology/) at the University of Tennessee Health Science 
Center seeks an outstanding researcher in the area of mechanisms of 
neurodegenerative diseases to fill an open rank tenure track position 
(Assistant, Associate, Full Professor). We seek candidates to complement 
and extend departmental and campus-wide research in neurodegenerative 
diseases, especially, but not limited to, traumatic brain injury, chronic 
traumatic encephalopathy, stroke, Alzheimer’s disease, Parkinson’s 
disease, and amyotrophic lateral sclerosis. Research incorporating 
contemporary virally based circuit tracing, optogenetic and DREAD 
approaches are encouraged. Successful candidates are expected to 
maintain an independent, extramurally funded research program, and 
contribute to the department’s teaching mission. The department and 
campus have state-of-the-art laboratories, core facilities, and unique 
populations of inbred mouse strains for elucidating the pathogenesis 
and treatment of neurodegenerative diseases (https://www.uthsc.edu/ 
citg/mouse-strain-and-pilot-projects.php). We offer competitive salary 
and start-up packages, and membership in our vibrant interdepartmental 
Neuroscience Institute (http://www.uthsc.edu/neuroscience/). 


Candidates must have a Ph.D. or M.D. and significant extramural funding. 
For best consideration applications should be submitted by April 1, 
2018. Submit curriculum vitae, summary of research interests, and 
contact information for three references, in a single Word or PDF file to 
hsimmers@uthsc.edu. 


The University of Tennessee is an EEO/AA/Title V1/Title IX, Section 
504/ADA/ADEA institution in the provision of its education and 
employment programs and services. 
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New Jersey Medical School 


Chief, Division of Allergy, 
Immunology and Infectious Disease 


The Department of Medicine at Rutgers Robert Wood Johnson 
School of Medicine in New Brunswick, New Jersey, seeks an academic- 
physician for the position of Chief. The desirable candidate should 
have a history of extramural funding in basic or translational research 
and demonstrable leadership qualities. She or he must also have been 
successful in promoting teaching, scholarship, and research, as well as 
developing, managing, and implementing clinical programs. Extensive 
collaboration opportunities are available within Rutgers University’s 
broad range of research programs including microbiology, immunology, 
oncology, pharmacology and global health. A competitive salary with a 
generous startup package is available to support the recruitment of the 
Chief and additional research and clinical faculty members. Candidates 
must have an MD or MD/PhD degree, have achieved the position of 
Associate Professor or Professor, be board certified in infectious disease, 


and eligible for licensure in New Jersey. Applicants should submit a 


letter of interest and current curriculum vitae to: Steven R. Brant, MD, 
Professor of Medicine, Chair of ID Chief Search Committee, via email 
at steven.brant@rwjms.rutgers.edu 


Rutgers, the State University of New Jersey, is an Equal Opportunity/ 
Affirmative Action employer, and is compliant with the Americans 
with Disabilities Act (ADA). For more information, please visit 
http://recruitment.rutgers.edu/TheRUCommitment.htm. Women and 
minorities are encouraged to apply. 
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In praise of slow 


huff and puff my way up the moderate slope. Even by my own abysmal standards, this is a poor 
run. In the past hour, I have been overtaken by both an octogenarian and a mum pushing her 
toddlers in a buggy. Yet I am smiling. I am a happy runner, despite my utter mediocrity at this 
sport. But at work, happiness had become elusive. After a relatively relaxed Ph.D. and postdoc, 
I had been thrilled when I landed a tenured job. But as I worked to establish myself as a group 
leader, I began to feel intense pressure to be more competitive and publish more. Recently, as 
I wondered why I felt so discontented at my job, I realized that I could apply some lessons from 


running to my research. 


Lesson one: In the right race, 
your weakness can become your 
strength. For more than 30 years 
I equated running to speed and 
so, lacking the fast-twitch muscles 
of a sprinter, I chose not to run at 
all. But once I finally started run- 
ning, I learned that not being 
able to sprint makes me a better 
endurance runner. Similarly, I often 
perceived my inability to focus on 
a single research topic as a barrier 
to success as an academic scien- 
tist. But I’ve found that my desire 
to branch out to different fields 
helps me make connections across 
disciplines and see my work in new 


much easier ride through the re- 
viewing system had I been slightly 
less honest about our findings. 
The temptation to cheat to get an 
advantage can be great. However, 
and this is something that is often 
overlooked, an advantage is only 
useful if you are, in fact, engaged 
in a competition. This brings me to 
the last lesson, which is the most 
important of all. 

Lesson four: There really is no 
race. For me, running isn’t about 
being faster than other runners. 
Likewise, my goal in research 
is not to “beat” my colleagues. 
Mark Rowlands, a_ philosopher, 


6 ° 
ways, which has led to unexpected Ne ot being able to academic, and runner, argues that 
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a silver-haired gentleman helped 

me beat my (lamentable) 5K personal best by whispering, 
“Go at your pace, not theirs!” when I got stuck behind 
slower runners. I now realize that this would have been 
excellent advice early in my career. Some Ph.D. students 
push themselves too hard and burn out, but I had the op- 
posite problem. I was happy to trundle along at the slowest 
pace I could get away with—but it ultimately held me back. 
Even though I like my slow-but-steady pace, I still needed 
to learn to push myself rather than drag my feet. 

Lesson three: An honest race is the only race worth 
running. I invariably finish in the bottom quartile of the 
local 5K run. All I would have to do to move up a couple 
of hundred places is take a shortcut through the fields. 
Yet I don’t. Nobody does. Scientists are usually like that, 
too—but not always. Principal investigators are pressured 
to keep their spot in the fast lane, postdocs are chasing the 
elusive permanent contract, and students are keen to make 
their mark. Several of my publications would have had a 


to eat more peanut butter or to 
save money on psychotherapy (although these are strong 
motivating factors in my case). I run because doing so 
offers a glimpse of life’s real value. I now think this is the 
secret to being happy in research, too. I don’t do research 
only to get invited to conferences, see my name in print, 
or be promoted. Like running, research is a game with its 
own intrinsic value. Playing this game of discovery gives 
me enough joy to keep me going. 

Do I recommend an academic career in the slow lane? It 
doesn’t work for everybody. Letting go of ambition in aca- 
demia is a bit like leaving your GPS watch at home when 
heading out for a run. Scientists are ambitious; they want 
to be the hares leading the race ahead. But the tortoise’s 
secret is that there is a lot of fun to be had at the back of 
the pack. 


Irene Nobeli is a lecturer at Birkbeck, University of London. 
Send your career story to SciCareerEditor@ aaas.org. 
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